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Preface 



Recent advances in sensing, communication and computation technologies have 
enabled a group of agents, such as robots, to communicate or sense their relative 
information and to perform tasks in a collaborative fashion. The past few years 
witnessed rapidly-growing research in cooperative control technology. Applications 
range from space interferometry sensing to environmental monitoring, to distributed 
computing, and distributed target surveillance and tracking. However, the analytical 
techniques used in cooperative control algorithms have been disparate, and a unified 
theory has been wanting. 

In this book, we present a passivity-based framework that allows a systematic de- 
sign of scalable and decentralized cooperative control laws. This framework makes 
explicit the passivity properties used implicitly in existing results and simplifies 
the design and analysis of a complex network of agents by exploiting the network 
structure and inherent passivity properties of agent dynamics. As we demonstrate 
in the book, this passivity-based framework can be easily tailored to handle classes 
of cooperative control problems. Each of these problems has important applica- 
tions in practice. For example, formation control and coordinated path following 
allow vehicles to maintain a tight relative configuration, thereby achieving effective 
group sensing or drag force reduction. Attitude coordination ensures that multiple 
spacecraft keep a precise relative attitude, which is important in space interferom- 
etry missions. Cooperative load transport enables robots to move an object around 
meanwhile exerting desired internal forces on the object. 

To demonstrate the design flexibility offered by the passivity-based framework, 
we feature various adaptive redesigns. Adaptivity is important in multi-agent sys- 
tems, as it implies that the agents can adjust their control according to changing 
conditions. A particularly useful scenario is when a group leader has the group mis- 
sion plan while the remaining agents have only partial information of the plan. As 
the leader autonomously modifies the mission plan, it is essential that the rest of the 
group be able to adapt. We develop adaptive designs with which the agents recover 
the leader's mission plan. Another scenario that requires adaptivity is when uncer- 
tainty exists in agents' controls, such as wind and viscous damping. In this case, the 
agents should adapt their control laws to simultaneously accommodate this uncer- 
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tainty and achieve the required task. We illustrate this scenario with adaptive designs 
for agreement of multiple Euler-Lagrange systems. 

The intention of this book is to summarize in a coherent manner the authors' re- 
cent research in passivity-based cooperative control of multi-agent systems. Related 
passivity approaches are applicable to other interconnected systems, such as data 
communication networks, biomolecular systems, building heating, ventilation, and 
air conditioning (HVAC) systems, and power networks, though these applications 
are outside the scope of this book. The organization of this book is as follows: 
Chapter 1 introduces cooperative control and presents the necessary background in 
graph theory and passivity. 

Chapter 2 presents the passivity-based approach to cooperative control and applies it 
to the agreement problem. We illustrate a position-based formation control problem 
that can be transformed into the agreement problem. We also study a distance-based 
formation control under the same passivity-based framework and compare it with 
the position-based formation control. 

Chapters 3 and 4 consider the situation where only one agent possesses the group 
reference velocity information and develop adaptive designs that recover such in- 
formation for the other agents. Chapter 3 adopts an internal model approach while 
Chapter 4 parameterizes the reference velocity with time-varying basis functions. 
These two adaptive schemes illustrate the design flexibility offered by the passivity- 
based approach. 

Chapter 5 investigates attitude agreement of multiple rigid bodies. We adapt the 
results in Chapters 2 and 4 to a similar passivity-based framework that addresses 
the agreement of agents in SO(3). 

Chapter 6 studies agreement problem of multiple Euler-Lagrange systems. In partic- 
ular, we consider the case where uncertainty exists in agents' dynamics. We present 
two adaptive designs that ensure the agreement in the presence of uncertainty. 
Chapter 7 presents a synchronized path following problem, where the agents achieve 
tracking of a desired formation by synchronizing path parameters. The synchroniza- 
tion of the path parameters is solved by the passivity-based agreement designs in 
Chapter 2. 

Chapter 8 studies cooperatively transporting a common load by multiple robots. We 
formulate this problem in a similar fashion to the position-based formation control 
in Chapter 2 and validate it with experiments. 

Chapter 9 provides an investigation of robustness properties for a second order linear 
cooperative control system. 
The Appendix includes technical proofs and tools utilized in this book. 

The results in this book would not have been possible without the help of our 
colleagues and collaborators. We are indebted to Professors Petar Kokotovic, Pana- 
giotis Tsiotras, Derek Paley, Ming Cao, Arthur Sanderson, and A. Agung Julius 
for insightful comments on parts of this work. We also acknowledge Emrah Biyik, 
Ivar- Andre Ihle and Thor Fossen, whose joint work with the second author are pre- 
sented in Section 4.5 and in Chapter 7. We would like to express our gratitude 
to Mehran Mesbahi, Wei Ren, Fumin Zhang and Sam Coogan for reviewing parts 
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of the manuscript and for providing valuable feedback. We also thank our editors 
Steven Elliot and Andrew Leigh for their interest and efforts in publishing this book. 
The first author is grateful to Professors Kevin Lynch and Randy Freeman for of- 
fering him a precious opportunity to continue working on cooperative control as 
a research fellow at Northwestern University and for their constant guidance and 
support. He also thanks John Wason and Karen Chapin for help in experimental 
implementation, and Aranya Chakrabortty, Fabio Morbidi, Florian Dorfler, Chinpei 
Tang, and Tiejun Zhang for helpful discussions during the preparation of the book. 
We acknowledge the financial support by the Air Force Office of Scientific Re- 
search under grants FA9550-09- 1-0092 and FA9550-07-1-0308 and by the National 
Science Foundation under grant ECCS-0852750. This work was also supported 
partly by the Center for Automation Technologies and Systems (CATS) at Rens- 
selaer Polytechnic Institute under a block grant from the New York State Office of 
Science, Technology, and Academic Research (NYSTAR). 

He Bai UtopiaCompression Corporation, Los Angeles, California 
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Chapter 1 
Introduction 



1.1 What Is Cooperative Control? 

Multiple robots or sensors have a number of advantages over a single agent, in- 
cluding robustness to failures of individual agents, reconfigurability, and the ability 
to perform challenging tasks such as environmental monitoring, target localization, 
that cannot be achieved by a single agent. A cooperative control system consists of a 
group of autonomous agents with sensing or communication capabilities, for exam- 
ple, robots with camera sensors and vehicles with communication devices. The goal 
of the group is to achieve prescribed agent and group behaviors using only local 
information available to each agent from sensing or communication devices. Such 
local information may include relative configuration and motion obtained from sens- 
ing or communication between agents, agent's sensor measurements, and so on. The 
relative sensing and communication dictates the architecture of information flow be- 
tween agents. Thus, a cooperative system has four basic elements: group objective, 
agents, information topology, and control algorithms governing the motion of the 
agents. 

Examples of group objectives in cooperative control include flocking, schooling, 
cohesion, guarding, escorting [112, 98, 131, 76, 51, 107], agreement [110, 75, 102, 
63, 10, 103, 5], vehicle formation maintenance [124, 47, 36, 5], gradient climbing 
in an environmental field [8, 98, 27, 18], cooperative load transport [14, 145, 106, 
129, 128], distributed estimation and optimal sensing [97, 99, 86, 153, 33], source 
seeking [151] and coverage control [34, 78]. Some of the cooperative control objec- 
tives involve only relative variables (e.g., relative positions) between agents while 
others depend on absolute variables (e.g., inertial positions) of agents. We illustrate 
below some examples of cooperative control. 

Formation Control. A major focus in cooperative control is formation stability, 
where the group objective is to stabilize the relative distances/positions between 
the agents to prescribed desired values. Formation maintenance finds natural ap- 
plications in coverage control, drag force reduction, and space interferometry. We 
take space interferometry as a motivating example. As shown in Fig. 1.1, space in- 
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Fig. 1.1 Space interferometry using multiple spacecraft. Spacecraft must maintain precise relative 
formation and the same attitude towards the incoming light to generate an interferometry pattern. 
The information flow between spacecraft is setup by optical sensors, which measure relative posi- 
tions between spacecraft. 



terferometry uses multiple spacecraft pointing towards an object of interest. Each 
spacecraft collects light from the object. If the relative positions between spacecraft 
are maintained precisely, an interferometry pattern is generated and measurement 
of the magnitude and phase of this pattern can be obtained by coherently mixing the 
light from individual spacecraft [73, 118, 123]. Multiple such measurements allow 
reconstruction of the image of the object, which has a much finer resolution than 
any single spaceborne telescope achieves. In space, position information of individ- 
ual spacecraft in the earth frame is imprecise or unavailable, whereas relative po- 
sition between spacecraft can be measured precisely by optical sensors [74]. Thus, 
maintaining spacecraft formation in space may make use of only relative position 
information while maneuvering the formation to point towards objects of interest. 

Agreement. In the agreement problem, the group objective is the convergence of 
distributed variables of interest (agents' positions or headings, phase of oscillations, 
etc.) to a common value. Given different contexts, the agreement problem is also 
called consensus, or synchronization. As shown in Fig. 1.1, space interferometry 
requires spacecraft to align their attitudes with each other, which is an agreement 
problem. Agreement problem also has potential applications in schooling and flock- 
ing in distributed robotics and biological systems [100, 112, 127, 24], distributed 
estimation and sensor fusion [99, 104], fire surveillance [25] and distributed com- 
puting [148, 140, 16], among others. 

Optimal Sensing. The group objective for optimal sensing is to optimally place 
the agents' positions so that certain meaningful utility functions are maximized. 
Examples of utility functions include probability of detecting a target [34, 78] and 
information obtained from a sensor network [86, 33]. In this case, the utility func- 
tions usually depend on the absolute positions of the agents. 

Most cooperative control problems concern coordinated motion of agents in dif- 
ferent scenarios. Therefore, agent dynamics become important in achieving differ- 
ent group objectives. Small mobile sensors or robots can be controlled by directly 
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manipulating their velocities. Such agents are commonly modeled by a first order 
kinematic model. Depending on the group objective, the agent model can be a sim- 
ple integrator or a set of integrators subject to nonholonomic constraints, such as 
a unicycle model. If the group objective is to control the position of the sensor to 
improve sensing capability, it may suffice to model the sensors as massless points 
with single integrator kinematics. When the velocities of agents are not directly ma- 
nipulatable or the masses of the agents are significant, double integrator agent dy- 
namics are more appropriate. In numerous applications, the attitudes of the agents 
play an important role, which means that agent models in Euler-Lagrangian form or 
in Hamiltonian form may be required. 

To achieve the group objective, each agent may need information from other 
agents. If agent i has access to agent y's information, the information of agent j 
flows to agent i and agent j is a neighbor of agent i. The abstract structure of the 
information flows in the group is then represented as a graph, where each agent is 
a node and the information flows between them are represented as links ' . In many 
applications, the information flow between agents is achieved by direct sensing or 
communication. For example, to control the relative distances between the agents, 
the agents obtain their relative distance information by sensing or by communicat- 
ing their inertial positions. In some applications, the information flow is realized 
through a physical medium. For example, consider a group of agents transporting 
a payload. By interacting with the payload, the agents can obtain the relative infor- 
mation between them without explicit communication. We will illustrate such an 
example in Chapter 8. 

One of the main challenges in cooperative control design is to achieve prescribed 
group objectives by distributed feedback laws. The distributed laws make use of 
information available only to individual agents. Such information includes the in- 
formation flow from neighbors, and sensor measurements from agent itself. Take 
the agreement problem as an example. When modeled as a first order integrator, the 
agent can aggregate the differences between its own state and its neighbors' and take 
that aggregated value as a feedback control. In this case, the control algorithm is dis- 
tributed since it only employs information from neighboring agents. If the control 
algorithm and the agent model are both linear, stability can be analyzed by examin- 
ing the eigenvalues of the closed-loop system matrix with the help of algebraic graph 
theory. This approach leads to simple stability criteria for the agreement problem, 
e.g., [63, 111, 102, 103, 47, 76, 87]. 

However, for some applications of cooperative control, only nonlinear algorithms 
can achieve the objective. Consider the following formation control problem: The 
group objective is to stabilize relative distances (the Euclidean norms of relative 
positions) between agents to desired values. In this case, the desired equilibria are 
spheres, which are compact sets containing more than one point. When each agent 
is modeled as a linear system, such as a double integrator, there is no linear feed- 
back law globally stabilizing the desired equilibria. This is because a linear agent 
model with linear feedback results in a linear system, whose equilibria can simply 
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be a point or a subspace. Thus, only nonlinear feedback laws may solve this forma- 
tion control problem. Indeed, most of the formation control algorithms have been 
proposed in the form of nonlinear artificial attraction and repulsion forces between 
neighboring agents. The design and analysis of such rules make use of graph theory 
and potential function methods. 



1.2 What Is in This Book? 

For different cooperative control problems, there are different control design meth- 
ods. In this book, we introduce a unifying passivity-based framework for cooperative 
control problems. Under this passivity-based framework, we develop robust, adap- 
tive, and scalable design techniques that address a broad class of cooperative con- 
trol problems, including the formation control and the agreement problem discussed 
above. 

This framework makes explicit the passivity properties used implicitly in the Lya- 
punov analysis of several earlier results, including [131, 98, 102], and simplifies the 
design and analysis of a complex network of agents by exploiting the network struc- 
ture and inherent passivity properties of agent dynamics. With this simplification, 
the passivity approach further overcomes the simplifying assumptions of existing 
designs and offers numerous advantages, including: 

1. Admissibility of complex and heterogenous agent dynamics: Unlike some of 
the existing cooperative control literature where the agent is modeled as a point 
robot, the passivity approach allows high order and nonlinear dynamics, including 
Lagrangian and Hamiltonian systems. As illustrated in Chapter 5, attitude coordina- 
tion among multiple rigid bodies can be studied under this passivity framework. In 
this case, the agent dynamics are in the Hamiltonian form. Chapter 6 discusses the 
agreement of multiple Lagrangian systems. The passivity approach is further appli- 
cable to heterogenous systems in which the agent dynamics and parameters, such as 
masses, dampings, vary across the group. 

2. Design flexibility, robustness and adaptivity: The passivity approach abstracts 
the common core of several multi-agent coordination problems, such as forma- 
tion stabilization, group agreement, and attitude coordination. Because passivity 
involves only input-output variables, it has inherent robustness to unknown model 
parameters. Since passivity is closely related to Lyapunov stability, this passivity 
approach lends itself to systematic adaptive designs that enhance robustness of co- 
operative systems. Such design flexibility and adaptivity will be demonstrated in 
this book by the adaptive designs in Chapters 3, 4 and 6. 

3. Modularity and scalability: The passivity framework yields decentralized con- 
trollers which allow the agents to make decisions based on relative information 
with respect to their neighbors, such as relative distance. A key advantage of the 
passivity-based design is its modularity, which means that the control laws do not 
rely on the knowledge of number of other agents, the communication structure of 
the network, or any other global network parameters. 
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Fig. 1.2 If the sensing/communication ranges for both robots are chosen to be the same and one 
agent is within the sensing/communication range of the other agent, the information flow between 
them is symmetric. S > denotes the sensing or communication radius. 



Our major assumptions for this passivity-based approach are: 

Bidirectional Information Topology: Control algorithms with bidirectional infor- 
mation topology tend to have inherent stability properties as we explicate with the 
help of passivity arguments in this book. Although directional information topology 
can render stability for first order agents [103, 109], it may lead to instability for 
high order agents, as we illustrate in Example 2.3 in Chapter 2. 

Bidirectional information topology also appears naturally in a number of cooper- 
ative control applications. For example, as shown in Fig. 1.2, the information topol- 
ogy of agents with the same sensing range can be modeled as bidirectional. In the 
load transport problem studied in Chapter 8, the agents exert force on the pay load 
and receive reaction forces from the payload. The exerted force and the reaction 
force contain implicitly the relative motion information between the agents and the 
payload. Thus, the information flows are bidirectional. 

Static Information Topology: Our design assumes that the information topology 
remains unchanged. This is not a restrictive assumption since in most practical sit- 
uations, the information topology remains static for a certain period of time. If that 
period of time is long enough, by standard dwell time arguments [91, 56], the closed- 
loop system remains stable. Note that for first order linear consensus protocols, such 
as those studied in [103, 63], robustness to arbitrary switching topology has been 
justified. However, for higher order systems, it is well known that switching may 
lead to instability [80, 81]. We will show that for first order protocols, the passivity- 
based framework can handle a broad class of switching topology whereas for higher 
order cooperative systems, topology switching improperly may result in instability. 
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1.3 Notation and Definition 



• We denote by K and by C the set of real numbers and complex numbers, respec- 
tively. The notation K>o denotes the set of all real nonnegative numbers. The real 
part and the imaginary part of a complex number x £ C are given by Re [x] and Im [x] , 
respectively. 

• All the vectors in this book are column vectors. The set of p by 1 real vectors 
is denoted by W while the set of p by q real matrices is denoted by R px ' ? . The 
transpose of a matrix A £ R px i is given by A T £ R' ?xp . 

•jV{A) and ffl(A) are the null space (kernel) and the range space of a matrix A, 
respectively. /„ and ; , denote the px p identity and zero matrices, respectively. The 
p x q zero matrix is denoted by pxq . Likewise, 1# and 0^ denote the N x 1 vector 
with each entry of 1 and 0, respectively. Without confusion, we will also use to 
denote a vector of zeros with a compatible dimension. 

• The Kronecker product of matrices A £ R mxn and B £ W xq is defined as 



A®B:~- 



CL\ \B 



a ln B 



®m\B ' ' ' @mn*3 



vmpxnq 



(1.1) 



and satisfies the properties 



(A(g>B) T =A T (g>B T 
(A®lp)(C®l p ) = {AC)®Ip 



(1.2) 
(1.3) 



where A and C are assumed to be compatible for multiplication. 

• The maximum and minimum eigenvalues of a symmetric matrix A are denoted by 
Amax(A) and A, n ; n (A), respectively. 

• For a vector x £ W, \x\ denotes its 2-norm, that is \x\ = Vx T x. 

• The norm of a matrix A is defined as its induced norm | |A| | = \/X max {A T A). 

• We use the notation diag{/q , K2, ■ ■ ■ ,K n } to denote the block diagonal matrix 











pxq 



pxq 

K 2 



\<W0 



pxq 



Vpxq\ 



Jpxq 



K n / 



(1.4) 



where K t £ R pxq , i = 1, •••,«. 

• The notation K = K T > means that K is a symmetric positive definite matrix 
while k > implies k is a positive scalar. 

• Given a vector vet 3 , the cross product vx is a linear operator, and can be repre- 
sented in a coordinate frame as left-multiplication by the skew-symmetric matrix: 
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-v 3 


v 2 


V'3 





-vi 


-V2 


VI 






(1-5) 



where (vi, V2, V3) are the components of v in the given coordinate frame. The in- 
verse operation of cross product is given by v , that is 



(^ 



(1.6) 



• For the coordinate frame representation of a vector, the leading superscript indi- 
cates the reference frame while the subscript i denotes the agent i. The superscript 
d means the desired value. As an illustration, Jyf means the desired velocity of the 
sth agent in the jth frame. 

• A function is said to be C k if its partial derivatives exist and are continuous up to 
order k. 

• Given a C 2 function P : K p — > R we denote by WP its gradient vector, and by V 2 P 
its Hessian matrix. 

• A function a : [0,a) — > R>o is of class K if it is continuous, strictly increasing and 
satisfies a(0) = 0. It is said to belong to class Ka, if a = 00 and a(r) — » °° as r — > <». 
A function /3 : M>o X ]R>o — > K>o is of class KL if, for each fixed s, the function 
P(r,s) belongs to class K with respect to r and, for each fixed r, the function fi(r,s) 
is decreasing with respect to s and j3 (r, s) — > as s — > 00. An example of class /TL 
functions is shown in Fig. 1.3. 



r=5 

r=10 



Fig. 1.3 The function j8(r,i) = re 5s is of class ATL because for fixed ; 



5s is decreasing and 



converges to zero as .v converges to 
respect to r. 



and for fixed s, re is monotonically increasing with 



• The system i = f(x,u) is said to be Input-to-State Stable (ISS) [125, 126] if there 
exist functions /3 € KL, p E K such that for any initial state x(to) and any bounded 
input u(t), the solution x(t) exists for all t > and satisfies 
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\x(t)\<P(\x(to)\,t-t ) + p( sup |»(t)|). (1.7) 

tQ<T<t 

• For a closed set &/, \x\.tf denotes the distance from the point % to si ', defined as 

l*U= inf \X-V\. (1.8) 

tigs/ 

• Given the dynamics of the state #(?), a closed invariant set ^/ is uniformly asymp- 
totically stable with region of attraction ^ if for each e > there exists 8 > such 
that 

|*fo)U<5 =► |Z(0U<£ Vr>r (1.9) 

and, if for each e > and r > 0, there exists T > such that for every initial condi- 
tion x(to) € ^ the resulting trajectory satisfies 

|*(Jb)U<r=*- |z(0l,/<£ Vf>T. (1.10) 

Several results on set stability and, in particular, converse Lyapunov theorems are 
presented in [82] and [137]. 

• We use the notation % — ¥ <?^°° to indicate a sequence of points % in Sf converging 
to a point on the boundary of ( S, or if Sf is unbounded, having the property \%\ — > oo. 



1.4 Basic Graph Theory 

In this book, we will make use of basic result from algebraic graph theory to facil- 
itate our analysis. The results presented in this section are standard in the literature 
and will be well known to readers familiar with graph theory. 

A graph is an abstract representation of a group of nodes where some of them 
are connected by links. More formally, a graph G is an ordered pair G — (V ,E) 
consisting of a set "V of nodes and a set E C "V x "¥ of links. Thus, a link is an 
ordered pair of two distinct nodes. 

A directed link (;', j) is an incoming link to node /' and an outgoing link from node 
/. We then draw an arrow from node i to node j. We call node i (respectively, j) the 
negative (respectively, positive) end of link (/, /'). If both links (/, /') and (j, i) belong 
to E, we combine these two links as one undirected link and use a bidirectional 
arrow to denote this link. 

Depending on the directions of the links, a graph may be categorized as directed 
or undirected. If a graph G consists of only undirected links, it is undirected. Other- 
wise, the graph is directed. 

We say node i is a neighbor of node j if the link (;', j) exists in the graph G. This 
means that for each directional link, the negative end is the neighbor of the positive 
end. Note that for undirected graphs, if node i is a neighbor of node j, then node j 
is also a neighbor of node i. We denote by jYj the set of neighbors of node j. 
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o 






Fig. 1.4 Different types of graphs of five nodes, (a): an undirected connected graph, (b): a balanced 
and strongly connected graph, (c): a strongly connected graph, (d): a weakly connected graph. A 
directed link is denoted by a line with a directional arrow while an undirected link is denoted by a 
bidirectional arrow. The node number is beside each node. 



For i £ ~f, if the number of incoming links to i is the same as the number of 
outgoing links from i, the graph is balanced. Clearly, an undirected graph is a special 
balanced graph. 

A directed path is a sequence of p nodes 1, •••,/», such that (i,i+ 1) £ E, 
\/i = 1 , • • • , p — 1. A cycle is a directed path such that the starting and the ending 
nodes of the path are the same. A graph is called strongly connected if there exists 
a directed path from any one node to another. Note that for an undirected graph, 
strong connectedness is simply termed connectedness. A graph is called weakly 
connected if replacing all the directed links in E with undirected ones gives a con- 
nected undirected graph. In Fig. 1.4 are several examples of five nodes illustrating 
connectedness of different graphs. 

Definition 1.1 (Graph Laplacian matrix L). 

Consider a directed graph G with N nodes. The Laplacian matrix of a graph G, 
denoted by L £ M. NxN , is given by 



til ■= 



-1 ifj£,A 

otherwise, 



(1.11) 



where \jV[ | is the cardinality of the set jYi. 

The definition in (1.11) results in the following property of L: 
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Property 1.1, The graph Laplacian matrix L has an eigenvalue of zero associated 
with an eigenvector 1#, i.e., IA^ = On- □ 

Example 1,1, Following (1.11), we compute the graph Laplacian matrices for the 
graphs in Fig. 1.4 as 



U 



( 2 -1 -1 \ 

-12-10 

0-12-10 

-10-13 -1 

\ -1 1 / 

( 1 0-10 \ 
-110 
0-12-10 
-10 2-1 

\ -1 1 / 



( 1 -1 \ 
-110 
0-1100 
0-12-1 

\ o o o-iiy 



(1.12) 



/ 2 -1 -1 0\ 
-12-100 
and L d = 0-12-10 
-10-120 

\ 0-11/ 

It is easy to see all these four Laplacian matrices satisfy Property 1.1. 



(1.13) 



□ 



In particular, the Laplacian matrix for undirected graphs satisfies Properties 1.2 
and 1.3 below. 

Property 1.2. The Laplacian matrix L of an undirected graph is symmetric and pos- 
itive semidefinite. □ 

Property 1.3. [17, Item 4e and Corollary 6.5] 

An undirected graph is connected if and only if the second smallest eigenvalue 
of its Laplacian matrix is strictly positive. □ 

We verify the positive semidefiniteness of L in Property 1.2 by showing 



y T Ly>0 VyeR N . 
To see this, we lety, be the ith element of y and note from (1.11) that 

N N 

y T Ly = Y,y> E (yt-yj) = 1 L Cy?-w) 

= iL(y>-yj) 2 +iL(y>y J -y 2 J )- 

i=l;'G.y(f i=lje^Vj 

Because the graph is undirected, we have 

N N 

LLy^LLy 2 



(1.14) 

(1.15) 
(1.16) 
(1.17) 



(1.18) 
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which implies that the last term in (1.17) is indeed — y T Ly. Therefore, it follows 
from (1.17) that 

y T Ly = ^tL(y-yj) 2 >°- d- 19 ) 

For a general directed graph, the graph Laplacian matrix L is not symmetric and 
y T Ly can be sign-indefinite. However, if the directed graph is balanced and strongly 
connected, y 1 Ly > holds for any y due to the following property: 

Property 1.4. [103] The graph Laplacian matrix L of a balanced and strongly con- 
nected graph G satisfies 



L + L T = -L sym (1.20) 



where L sym represents the graph Laplacian matrix of the undirected graph obtained 
by replacing the directed edges in G with undirected ones. □ 

For an undirected graph G, we may assign an orientation to G by considering one 
of the two nodes of a link to be the positive end. We denote by J^ + (J§f)~) the set of 
links for which node (' is the positive (negative) end. 

Definition 1.2 (Graph Incidence matrix D). 

Denoting by £ the total number of links, we define the N x £ incidence matrix D 
of an undirected graph G as 

-lif*€J2I + 
d ik := { -1 if k&5?r (1.21) 

otherwise. 

□ 

Property 1.5. We obtain from (1.21) an incidence matrix D corresponding to a par- 
ticular orientation assignment to the undirected graph G. Independently of how we 
assign the orientation to G, the resulting incidence matrix D has the following prop- 
erties: 

1 . The rank of D is at most N — 1 and the rank of D is N — 1 if and only if the graph 
G is connected; 

2. The columns of D are linearly independent when no cycles exist in the graph; 

3. If the graph G is connected, the only null space of D T is spanned by 1^; 

4. The graph Laplacian matrix L of G satisfies 

L = DD T . (1.22) 

□ 

Example 1.2. We verify the last item in Property 1.5 by considering the graph G in 
Fig. 1.5. We obtain from (1.11) that 
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Fig. 1.5 An undirected graph of four agents whose Laplacian matrix is in ( 1 .23). The agent number 
is illustrated at each node. 





(a) (b) 

Fig. 1.6 Two different orientation assignments to the graph in Fig. 1.5 yields two different graph 
incidence matrices D in (1.24). However, both incidence matrices give the same Laplacian matrix 
(1.23) using (1.22). The arrow points to the positive end of each link. The link number is denoted 
in italic at each link. 



/ 1 -1 \ 

-1 3 -1 -1 

0-12-1 

\ -1 -1 2 ) 



(1.23) 



To show that the choice of D does not affect L, we assign different orientations to G 
as in Fig. 1.6 and obtain the two incidence matrices D as 



D„ 



/-l \ 

1 -1-10 

1-1 

\ 1 1 ) 



and Db 



/ 1 \ 

-1-11 
10 1 

V o o -l -1/ 



(1.24) 



A simple computation shows that L = D u D T a — D/,Dj } . Thus, the choice of orien- 
tation assignment to the graph does not affect the graph Laplacian matrix. □ 
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1.5 Passivity and Passivity-preserving Structures 

In this section, we briefly review the definition of passivity and its relation to stabil- 
ity. We also present four passivity-preserving structures that will be utilized in the 
rest of the book. Some of the results in this section are based on [69, 116]. 

Definition 1.3 (Passivity of Static Nonlinearity). 

A static nonlinearity y = h(u), where h : MP — * W, is passive if, for all u £ M. p , 

u T y = u T h(u)>0; (1.25) 

and strictly passive if ( 1 .25) holds with strict inequality Vm ^ 0. □ 

Definition 1.4 (Passivity and Strict Passivity of Dynamical Systems). 

The dynamical system 

•*' \y=h^,u), l;£W,u,y£W, {l - M> 

is said to be passive if there exists a C 1 storage function S(£ ) > such that 

S = VS(l;) T f(i;,u)<-W(!;) + u T y (1.27) 

for some positive semidefinite function W(%). We say that (1.26) is strictly passive 
ifW(^) is positive definite. □ 

Definition 1.5 (Strict Input and Output Passivity). 

For the dynamic system (1.26), if S in (1.27) satisfies 

S<-u T \j/(u) + u T y (1.28) 

for some function \j/(u) such that u T \]/(u) > 0, then (1.26) is input strictly passive. 
Likewise, if 

$<-y T Y(y)+u T y (1.29) 

holds for some function y(y) where y T y/(y) > 0, (1.26) is output strictly passive. 

□ 

Example 1.3 (Passivity of Euler-Lagrange Systems). 

A standard model of mechanical systems with n degrees of freedom is given by 
the Euler-Lagrange equation: 

where x— [xi,---,x„] T are the generalized coordinates of the system and T = 
[ti , • • • , T„] T is the generalized torque acting on the system. The Lagrangian function 
L(x,x) satisfies 
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L(x,x)=K(x,x)-P(x) (1.31) 

where P(x) is the potential energy of the system and is bounded from below, i.e., 

P(x)>P:=mmP(x), (1.32) 

X 

and K(x,x) is the kinetic energy of the system which is assumed to be of the form 

1 T 

K(x,x) = -x T M{x)x, (1.33) 

in which M(x) = M(x) T is the positive definite generalized inertia matrix. 
A further computation from (1.30) and (1.33) leads to 



M(x) x + C(x, x)x + g (x) = z (1.34) 

well 
symmetric [6, 71], i.e. 



where g(x) = Jf . A well known property of (1.34) is that M(x) — 2C(x,i) is skew- 



M(x) -2C(x,x) = -(M(x) -2C{x,x)) 1 . (1.35) 

The Euler-Lagrange system (1.34) is passive from the generalized torque input T 
to the generalized velocity x. Such a result is established by using (1.35) and taking 
the total energy of the system V = K(x,x) +P(x) — P as the storage function. The 
derivative of V is given by 

V = x T M(x)x+ -x T M(x)x + g(x) (1.36) 

= x T T+-x T (M(x)-2C(x,x))x (1.37) 



If T is chosen as 



x't. (1.38) 



z = -Rx+T e , R = R T >0, (1.39) 



we immediately verify the strict output passivity from x e to x. □ 

Passivity of a linear time invariant dynamic system is closely related to positive 
realness of the transfer function of that system. 

Definition 1.6. [Positive Realness] 

A scalar transfer function g(s) is called positive real if 

• poles of g(s) have nonpositive real parts; 

• for all (0 £ K for which jco is not a pole of g(s), Re[g(ja>)] > 0; 

• any pure imaginary pole jco of g(s) is a simple pole and the associated residues 
are nonnegative. □ 
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The second condition in Definition 1.6 means that the Nyquist plot of g(jco) lies 
in the closed right-half complex plane, which implies that the phase shift of g(s) 
cannot exceed ±90°. 

Definition 1.7. [Strict Positive Realness [61, 142]] 

A transfer function g(s) is called strictly positive real if g(s — e) is positive real 
for some £ > 0. □ 

The strict positive realness of g(s) can also be characterized in the following 
lemma: 

Lemma 1.1. A scalar transfer function g(s) is strictly positive real if and only if 

• poles of g(s) have negative real parts; 

• for all ffl£l, Re[g(jco)} > 0; 

• either g(°°) > or g(°°) = and lim^^so CQ 2 Re[g(jco)} > 0. □ 

Example 1.4. The first-order integrator g(s) = y is positive real since it has a simple 
pole at CO — 0, associated with residue 1, and 



Re 



1 



= 0, Vffl^O. (1.40) 



The second-order integrator g(s) = 4 is not positive real since the phase shift of 
g0)is-180°. 

The transfer function g(s) = -^— for a, c > is strictly positive real since g(s — e) 
is positive real for £ = c/a > 0. □ 

When a transfer function g(s) is realized by a minimal state space representation 

[y = c$+Du, 

the positive realness of g(s) means that (1.41) is passive. 

Lemma 1.2. Let M' in (1.41) be a minimal state space representation ofg(s). Then, 

• Ji? is passive ifg{s) is positive real; 

• Ji? is strictly passive ifg(s) is strictly positive real. □ 

The passivity property of a dynamical system remains unchanged when the input 
and output variables are transformed in a "symmetric" fashion as in Fig. 1.7. 

Structure 1 (Symmetric Input-Output Transformation) Let the system H in Fig. 
1.7 be passive and let E be a matrix with a compatible dimension. Then the system 
in Fig. 1.7 is passive from utoy. □ 

Proof. Note that u T y — (E T u) T y — u T y. Thus, the passivity from u to y translates to 
the passivity from u to y. □ 
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H 



Fig. 1.7 Pre- and post- multiplication of a matrix and its transpose preserves the passivity of H. 




Fig. 1.8 Parallel interconnection of two passive systems. 



The definition of passivity closely relates to the stability of (1.26) when u = 0. In 
fact, when the storage function S is positive definite, (1.27) implies that for u = 0, 



S<-W(%)<0. 



(1.42) 



Assume that /(0, 0) = 0. Using standard Lyapunov arguments, we conclude that the 
unforced system | = /(<*,0) has a stable equilibrium E, = 0. If, in addition, W{t,) 
is positive definite, ^ = is asymptotically stable. If S is also proper, i.e., S(^) — ► °° 
as | ^ | — > oo, the asymptotic stability of <^ = is global. 

The stability properties are preserved if two or more passive systems are intercon- 
nected properly. Among all possible passivity-preserving structures, the following 
three structures are employed in our cooperative control design. 

Structure 2 (Parallel Interconnection) Consider the parallel interconnection of 
two passive systems H\ and H2 in Fig. 1.8. Then the interconnected system is pas- 
sive from u to y. □ 

Structure 3 (Negative Feedback Interconnection) Consider the negative feedback 
interconnection of two passive systems Hi and H2 in Fig. 1.9. Then the intercon- 
nected system is passive from u to y. □ 

Replacing H\ in Structure 3 with Structure 1, we obtain Structure 4 below: 

Structure 4 (Symmetric Interconnection) Consider the interconnection structure 
of two passive systems H\ and H2 in Fig. 1.10. Then the interconnected system is 
passive from u to y. □ 

We will demonstrate in the next chapter that Structure 4 arises naturally in co- 
operative control with bidirectional information flow. In particular, the matrices E 
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Fig. 1.9 Negative feedback interconnection of two passive systems. 
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Fig. 1.10 Symmetric Interconnection of two passive systems Hi and H2 is still passive. 



and E T are dictated by the undirected information topology between the agents. The 
H\ and H2 blocks in Structure 4, being block diagonal, represent the dynamics of 
individual agents and their relative configuration, respectively. We will then apply 
passivation designs to Hi and H2 such that the closed-loop stability is guaranteed by 
Structure 4. 



Chapter 2 

Passivity As a Design Tool for Cooperative 

Control 



2.1 Introduction 

In this chapter, we formulate a coordination problem that is applicable to formation 
stabilization and group agreement as special cases, and present a class of feedback 
laws that solve this problem with local information. A key observation is that bidi- 
rectional communication gives rise to Structure 4 in Section 1.5, which guarantees 
that the resulting feedback loop will inherit the passivity properties of its compo- 
nents. By exploiting this structure, we develop a design method which results in a 
broad class of feedback laws that achieve passivity and, thus, stability of the inter- 
connected system. The passivity approach also leads to a systematic construction of 
a Lyapunov function in the form of a sum of storage functions for the subsystems. 
As detailed in this chapter, several existing feedback rules for formation stability 
and group agreement become special cases in the passivity framework. 

The coordination task studied in this chapter is to steer the differences between 
the output variables of group members to a prescribed compact set. This compact 
set may be a sphere when the outputs are positions of vehicles that must maintain 
a given distance in a formation, or the origin if the outputs are variables that must 
reach an agreement across the group. We thus formulate this task as a set stability 
problem and use passivity as a tool for constructing a stabilizing feedback law and 
a Lyapunov function with respect to this set. We prove global asymptotic stability 
with additional assumptions that guarantee appropriate detectability properties for 
trajectories away from the set. 



2.2 Problem Statement 

Consider a group of N agents, where each agent i = 1, — ,JV, is represented by a 
vector Xi £ M. p that consists of variables to be coordinated with the rest of the group. 
The topology of information exchange between these agents is modeled as a graph 
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G. Since the information flow between neighbors is assumed to be bidirectional, G is 
an undirected graph. We also assume that G is connected and that G has £ undirected 
links. To simplify the analysis, we assign an orientation to G by considering one of 
the nodes to be the positive end of the link. As discussed in Section 1 .4, the choice of 
orientation does not change the results because of the symmetric information flow. 
The objective is to develop coordination laws that are implementable with local 
information (agent i can use the information of agent j if agent j is a neighbor) and 
that guarantee the following two group behaviors: 

Alj Each agent achieves in the limit a common velocity vector v(t) £ K p prescribed 
for the group; that is 

lim|i,-v(f)| =0, i=l,---,N; (2.1) 

A2) If agents i and j are connected by link k, then the difference variable Zk 

*'-=%^={X-%?£% (2 - 2) 

converges to a prescribed compact set s^k C K p , k — 1 , • • • ,£, where d^ is defined in 
(1.21). 

The reference velocity v(t) can be considered as a "mission plan" of the group. 
By specifying different v(t), we achieve different group motions, such as rotational 
and translational motions. Examples of target sets ja4 include the origin if xt's are 
variables that must reach an agreement within the group, or a sphere in W if x,'s are 
positions of vehicles that must maintain a prescribed distance. Objectives A1-A2 
may be employed to design and stabilize a formation of vehicles, or to synchronize 
variables in a distributed network of satellites, etc. 

We introduce the concatenated vectors 

x:=[x T 1 ,---,x T N } T eR pN z:=[z T l ,---,zJ] T £R pi . (2.3) 

We partition D in terms of columns vectors, i.e., 

D=[Di\---\D e ] (2.4) 

and note from (2.2) that 

z k = (D T k ®I p )x. (2.5) 

Concatenating Zk's together, we have 

z={D T ®I p )x (2.6) 

which means that z is restricted to be in the range space 3?{p T ®I p ). Thus, for the 
objective A2 to be feasible, the target sets jz4 must satisfy 

{M x---x^}n^(£) r ®/ p )^0. (2.7) 
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Fig. 2.1 Step 1 transforms agent dynamics from (2.9) to (2.10) by designing an internal feedback 
T,. This internal feedback achieves for agent i passivity from an external feedback signal W; to the 
velocity error >',-. The resulting passive block is denoted by J%. 



2.3 The Passivity-based Design Procedure 

Step 1. Design an internal feedback loop for each agent i = 1, • • • ,]V that renders its 
dynamics passive from an external feedback signal m,- (to be designed in Step 2) to 
the velocity error 

yi :=xi-v{t). (2.8) 



Assume that the input-output dynamics of agent i are given by 



(2.9) 



where ^"{t,} denotes the output of a dynamic system Jf { with the control input 
Xi. The system J^" may be linear (e.g., single/double integrators) or nonlinear (e.g., 
Euler-Lagrange equation). In Step 1, we seek a feedback controller T; for each agent 
such that the agent dynamics J%° in (2.9) may be expressed as 



ii =J%{ui}+v(t), 



(2.10) 



where M'i is as in Fig. 2.1. For example, for the first order agent dynamics i, = T,, 
Step 1 is trivially accomplished by choosing Tj = Jfi{ui} + v(t). 
If J$?i is dynamic, we assume that it is of the form 



' ' ' \yi = hi(£i,ui) 



(2.11) 



where y ; - is the velocity error and ^,- £ K" 1 is the state variable of subsystem J^. We 
assume that /;(•, •) and h{(-,-) are C 2 functions such that 



and 



fi(0,Ui)= 0=^ = 



/!;(0,0) = 0. 



(2.12) 



(2.13) 



The main restriction on (2.11) is that it be strictly passive with C , positive defi- 
nite, radially unbounded storage functions Sj(^i) satisfying (1.27) for some positive 
definite functions W;(-). 
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If Ji^i is a static block, we restrict it to be of the form 

yi = hi{ui) (2.14) 

where hi : W — > W is a locally Lipschitz function satisfying (1.25) for any u ^ 0. 
In the situation where one of the agents, say agent 1, is the "leader" of the group in 
the sense that x\ uses no feedback term from the other agents, we let 

fci(wi) = VmiGK p . (2.15) 

We next show how to apply Step 1 to agents modeled as double integrators. In 
Chapters 5 and 6, we will demonstrate that broader classes of physical systems, 
including rigid body rotation and Euler-Lagrange systems in (1.34), may be trans- 
formed to the form in Step 1. 

Example 2.1 (Step 1 for agents modeled as double integrators). 
We consider double integrator agent dynamics 

miXi = Ti, i=l,---,N (2.16) 

where m,- is the mass of agent i, Xj e V denotes the position of agent i and T; € W 
is the force input of agent i. For planar agents, p = 2 and for spatial agents, p = 3. 
According to Step 1, we design an internal feedback 

Tj = -ki(J(i - v(t))+nijv(t) + u { , k > (2.17) 

which makes use of information available only to agent i itself. This feedback law, 
together with the change of variables 



brings (2.16) to the form 



$i=x t -v{t), (2.18) 

Xi= yi + v{t) (2.19) 

,m ■. { m & = r**6 + Ui (2.20) 

I yt = &• 

Note that Jfi is first order because we effectively consider E,, as the state variable 
instead of x,. The transfer matrix of (2.20) from m,- to yi is 

Hi(s) = — l —- I p , k t >0, (2.21) 

miS + kj 

which is strictly positive real as shown in Example 1.4. Thus, Jtf is strictly passive 
due to Lemma 1.2. Indeed, a valid storage function for (2.20) is given by 

$(£) - \m$$i. (2-22) 
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It is easy to examine that the assumptions in (2.12) and (2.13) are satisfied in (2.20). 
Thus, Step 1 is completed with the control law in (2. 17). Note that other higher order 
control laws can be designed to render J^f strictly passive. □ 

Step 2. Design an external feedback signal ut of the form 

I 
u, = -Y, d 'k¥k(z k ) (2.23) 

fc=i 

where Zk 's are the relative variables as in (2.2), and the multivariable nonlinearities 
y/j. : R p — > M. p are to be designed such that the target sets s$k are invariant and 
asymptotically stable. 

The external feedback law (2.23) is decentralized and implementable with avail- 
able information since d^ ^ only when link k is connected to node ;'. 

Before specifying the properties of %, we note from Fig. 2.1 and (2.23) that the 
interconnection of ^f s and %'s is as in Fig. 2.2, where 



«=[«[,■•., «£] r €R** v=[tf,-,y[] T €W* y=\y T l ,--- ) y T N ] T ^ pN . 

(2.24) 
Note from (2.23) that 

"i = 

which means 



d n I p \---\d u I p ]y/ i (2.25) 



u = -(D®I p )\ir(z). (2.26) 

Fig. 2.2 exhibits a "symmetric" interconnection structure similar to Structure 4 
in Section 1.5. The symmetric interconnection follows from the symmetry inherent 
in the undirected graphs. This structure allows us to proceed with a passivity-based 
design of 1//*, k = 1 , • • • , I. 

We design the nonlinearities \jfk(zk) to be of the form 

Yk(z k ) = VPk{z k ) (2.27) 

where Pk(zk) is a nonnegative C 2 function 

Pk : % -> M>o (2.28) 

defined on an open set ?i C l p , where Zk is allowed to evolve. As an illustration, if 
xt's are positions of point masses that must maintain a prescribed distance, then the 
choice < $k. — {zk I Zk S R p \ 0} disallows the possibility of collisions between linked 
agents. 

To steer Zk's into the target sets &$k c 54, we let Pk(zk) and its gradient VPk(zk) 
vanish on s^k, and let Pk(zk) grow unbounded as Zk goes to the boundary of Sfj: 

Pk(zk)^™asz k ^d% (2.29) 

Pk{zk)=0^z k e^ k (2.30) 

VPk(zk)=0&Z k €^ k . (2.31) 



24 
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— ^ >D T ®I 




Fig. 2.2 The closed-loop structure of (2.8), (2.11) and (2.26): J£fs are designed to be strictly 
passive while pre- and post-multiplying D T and D preserves the passivity from z to \jf. The closed- 
loop stability follows from the interconnection of two passive systems. 



When 54 = MP, (2.29) means that Pk(zk) is radially unbounded. As shown in [137, 
Remark 2], a continuous function Pk(zk) satisfying (2.29) and (2.30) exists for any 
given open set §0^ and compact subset s^\ C S^. We further assume that the sets ^4 
and 05fc are chosen such that C 2 smoothness of Pjc(') and (2.31) are also achievable. 
For example, if two agents need to reach a common value, we let s^ k = {0} and 
% = W. Then the choice of P k (z k ) = \\z k \ 2 satisfies (2.28)-(2.31). If two agents 
must maintain a relative distance of 1, we may choose stf k — {z k \ \z k \ = 1} and 
&k = {zk I %k G R p \ 0}. In this case, a valid choice of P k is given by P k (zk) = |z*| — 

lnW-1- 

The construction of \ff k as in (2.27) is designed to render the system from ik to 

y/k (and hence from z to \ff due to the block diagonal structure in Fig. 2.2) passive. 

Indeed, consider P k as a storage function and note that 



Pk = Wk{zk) T z k , 



(2.32) 



which shows the passivity property. 



2.4 Stability Results 



From Fig. 2.2, the set of equilibria is given by 

S = {(z,§)| % = 0, (D®I p )y{z) = and z e ,^(D 7 ' ®/ p )} 



(2.33) 



which means that the following property must hold true to ensure that no equilibria 
arises outside the sets s^\\ 

Property 2.1. For any (z,0) € S, i.e., | = 0, (D® J p )v(z) = and z e M(D T (gtI p ), 
z satisfies z G M x • • • x ^. D 

In view of (2.26), Property 2.1 means that 
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u = ^=> z = {D T ®l p )x e M x . . . x j^. (2.34) 

When the graph has no cycles, that is, when the columns of D are linearly inde- 
pendent, then (D®Ip)\j/(z) — implies y/(z) — and Zk G ^4 follows from (2.31). 
Thus, Property 2.1 holds for acyclic graphs. When the columns of D are linearly 
dependent, whether Property 2.1 holds depends on the sets ^4 and iffk- As we will 
illustrate, it holds in agreement problems where s^k is me origin but fails in the 
formation control problem with distance only criterion where £?% is a sphere. 

The feedback interconnection shown in Fig. 2.2 is of the same form as Structure 

4 in Section 1.5. The storage functions for the feedforward and feedback subsystems 
of Fig. 2.2 are, respectively, 

t 

where J? denotes the subset of indices i= l,---,N that correspond to dynamic 
blocks J#i, In particular, the passivity of the feedforward subsystems follows from 
Structure 1 in Section 1.5. Using the passivity of the feedforward and feedback 
subsystems and Structure 4 in Section 1.5 and imposing Property 2.1, we prove 
asymptotic stability of the set of points where | = and Zk G ^4 by taking as a 
Lyapunov function the sum of the two storage functions in (2.35). This construction 
results in a Lur'e-type Lyapunov function because its key ingredient f%(zk) is the 
integral of the feedback nonlinearity yfk(zk) = ^Pk(zk)- We summarize the main 
stability result in the following theorem. 

Theorem 2.1. Consider the closed-loop system (2.8), (2.11) and (2.23), where v(t) 

is uniformly bounded and piecewise continuous and J#t, i = 1 , • • • , N, and \j/k> k = 

1, • • • ,£ are designed as in (2.11)-(2.15) and (2.27)-(2.31) for given open sets 5fj C 

R^ and compact subsets sik C Sft, where &/% are as in (2. 7). Then: 

i) The feedforward path in Fig. 2.2 is passive from x to —u, and from y to u; 

ii) The feedback path is passive from input u toy; 

Hi) When Property 2. 1 holds, the set 

£^ = {(z,%) |§=0,ze^i x---x£/ e n^(D T ®l p )} (2.36) 

is uniformly asymptotically stable with region of attraction 

^ = {(z^)|^l"' x-xl"",ze^ix.-x^n*(fl r ®/ p )}. (2.37) 

Moreover, all trajectories {z(t),^(t)) starting in <£ converge to the set of equilibria 

5 in (2.33). O 

When Property 2.1 fails, Theorem 2.1 proves that all trajectories converge to the 
set of equilibria & in (2.33). In this case, it is possible to conclude "generic conver- 
gence" to stf from almost all initial conditions if one can show that the equilibria 
outside of si are unstable. We will illustrate such an example in Section 2.7. 
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Convergence to si means that the difference variables Zk tend to the target sets 
srfk- It also implies that t, = 0, u = and thus, y in (2.8) is zero, which means that 
both objectives Al and A2 are indeed achieved. 



Proof (Proof of Theorem 2.1). 

i) To prove passivity from x to — u we use VAz) in (2.35) as a storage function, 
and obtain from (2.27), (2.6), (2.26) and (D®I p ) T = D T '®I p : 

V f = y/ T z = y/ T (D T (E)I p )x={{D(E)I p )\i/} T x = -u T x. (2.38) 

To show passivity from y to — u we substitute x = Ijv ® v(f) +y in (2.38) and use the 
fact (D T (8)I p )(In<& v(t)) = from the third item in Property 1.5, thus obtaining 

V f = y T (D T ®I p ){\ N ®v(t)+y} = \f(D T ®I p )y 

= {{D®I p )y} T y=-u T y. (2.39) 

ii) To establish passivity of the feedback path, we let Jt denote the subset of indices 
i = 1, • • • ,N for which Jtf is a dynamic block as in (2.11), and employ the storage 
function V&(|) in (2.35), which yields: 

?»=£&<£ (~W(&) + wfyj)- ( 2 - 40 ) 

Adding to the right-hand side of (2.40) 

E "fyi > ° ( 2 - 41 ) 

which is nonnegative because the static blocks satisfy (1.25) or (2.15), we get 

% < £ (-Wi(^) + uj yi ) + £ «f y, < - £ Wi(§») + « r y (2.42) 

and, thus, conclude passivity with input u and output y. 

iii) To prove asymptotic stability of the set &/ we use the Lyapunov function 

V(z,§) = V / (z)+V 6 (§) (2.43) 

which is zero on the set sd due to property (2.30), and grows unbounded as (z, % ) ap- 
proaches d^°° due to property (2.29). From (2.39), (2.40) and (2.41), this Lyapunov 
function yields the negative semidefinite derivative 

V < - £ Wm - £ "hi, (2-44) 

which implies that the trajectories (z(t)^(t)) are bounded on t £ [0,T], for any T 
within the maximal interval of definition [0,?/) for the differential equations (2.8), 
(2. 11). Because this bound does not depend on T, and because v(t) is bounded, from 
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(2.8) we can find a bound on x(t) that grows linearly in T . This proves that there is 
no finite escape time because, if 1 / were finite, by letting T — * f /■ we would conclude 
that jc(f/) exists, which is a contradiction. 

Having proven the existence of solutions for all t > 0, we conclude from (2.44) 
stability of the set si ' . However, because the right-hand side of (2.44) vanishes on 
a superset of si ', to prove attractivity of si we appeal to the Invariance Principle 1 
reviewed in Appendix B.2. To investigate the largest invariant set where V(z, §) = 
we note from (2.12) that if <!;,■ = holds identically then Uj = 0. Likewise the 
static blocks satisfy (1.25) or (2.15), which means that the right-hand side of (2.44) 
vanishes when u, ■ = 0, i = 1, • • • ,N. Indeed, if the first member i = 1 satisfies (1.25), 
then u\ = follows directly. If it satisfies (2.15) instead of (1.25), u\ — still holds 
because the sum of the rows of D being zero implies, from (2.6), that 

N 
«i=-£m,=0. (2.45) 

i=2 

We thus conclude that u — 0, which means from (2.26) that ljf(z) lies in the null 
space JV{D®I P ). Using the Invariance Principle, which states that all bounded 
solutions approach their positive limit set, which is invariant, we conclude that the 
trajectories converge to the set S in (2.33). When Property 2.1 holds, S coincides 
with si, which proves asymptotic stability of si with region of attraction §f , while 
uniformity of asymptotic stability follows from the time-invariance of the (z, §)- 
dynamics. □ 

The Lyapunov function V(z, §) in the proof above yields a negative semidefi- 
nite derivative. By using the observability condition in (2.12), we prove the stability 
results in Theorem 2.1. This Lyapunov function allows us to develop different adap- 
tive schemes to enhance robustness of group motion. For example, in Chapters 3 
and 4, we develop adaptive schemes that enable agents to estimate leader's mis- 
sion plan v(t). These adaptive schemes relax the assumption in Theorem 2.1 that all 
the agents must have the v(t) information. In Chapter 6, where agreement of multi- 
ple Euler-Lagrange systems is studied, we attempt to design adaptive control laws 
from V(z, £) to compensate for uncertainties in Euler-Lagrange systems. However, 
we illustrate with an example that the resulting adaptive design is not sufficient to 
ensure group objectives. This is because V is only negative semidefinite. We will de- 
tail in Chapter 6 how we overcome this insufficiency by exploiting the structure of 
Euler-Lagrange equations and the design flexibility offered by the passivity-based 
framework. 



1 The Invariance Principle is indeed applicable because the dynamics of (z, §) are autonomous: 
Although v(f) appears in the block diagram in Fig. 2.2, it is canceled in the i equation because 

{D T ®I p ){\ N ®v{t))=0. 
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2.5 Application to the Agreement Problem 

In several cooperative tasks, it is of interest to steer group variables, such as position, 
heading, phase of oscillators, to a common value. To apply Theorem 2.1 to this 
problem, we let xi E R p denote a vector of variables of interest, and select the target 
sets to be jz4 = {0}. With this choice of g/ k , the target set constraint (2.7) is trivially 
satisfied. We may choose P k (z k ) as a positive definite, radially unbounded function 
on SJJfc = M p with the property 

z T k VP k (z k )=z T k Wk(z k )>0 Vz^O (2.46) 

so that (2.27)-(2.31) and Property 2.1 hold. In particular, Property 2.1 holds because 
z € ,^%{D T <g)I p ) and yr(z) e jV{D®I p ) imply that z and \f/(z) are orthogonal to each 
other, which, in view of (2.46), is possible only if z = 0. 

Corollary 2.1. Consider agents i = l,---,N, interconnected as described by the 
graph representation (1.21), and let z k , k — !,■■■,£ denote the differences between 
the variables x\ of neighboring members as in (2.2). Let P k (z k ) be positive definite, 
radially unbounded functions satisfying (2.46) and let y k (z k ) = VP k (z k ). Then the 
agreement protocol 

xi = <^i\- 1 td ik y k (z k )\+v(t), i=l,---,N (2.47) 

where Jifj{ut} denotes the output at time t of a static or dynamic block satisfying 
(2.11)-(2.15), guarantees |i,- — v(f)| — ► and 

xt — Xj — > as t — ► °° (2.48) 

for every pair of nodes (i, j) which are connected by a path. □ 

When p = 1, that is when jc,'s and Zjt's are scalars, condition (2.46) means that 
Vk(z k ) = VPk(Zk) is a sector nonlinearity which lies in the first and third quadrants. 
Corollary 2.1 thus encompasses the result of [102], which proposed agreement pro- 
tocols of the form 

*/ = - E fojfa-xj) ( 2 - 49 > 

where 0y(-) = ^yj(') P ia y s the role of our %(•). However, both [102] and a related 
result in [122] assume that the nonlinearities 0y(-) satisfy an incremental sector 
assumption which is more restrictive than the sector condition (2.46) of Corollary 
2.1, An independent study in [147] takes a similar approach to synchronization as 
[122]; however, it further restricts the coupling terms </»,■_,-(• ) to be linear. The feed- 
back law (2.47) in Corollary 2.1 generalizes (2.49) by applying to its right-hand 
side the additional operation Jfi{-}, which may either represent a passive filter or 
another sector nonlinearity /!,(•) as specified in Section 2.3. Because Jtf in (2.47) 
can be dynamic, Corollary 2. 1 is applicable, unlike other agreement results surveyed 
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in [1 10], to plants with higher-order dynamics than an integrator. See, for example, 
the second order system in Section 2.6.1. 



2.6 Position-based Formation Control As a Shifted Agreement 
Problem 

One of the major topics in cooperative control is the formation maintenance and 
stability, where the goal is to drive relative positions (i.e., z^'s) or relative distances 
(i.e., \zk\'s) between agents to prescribed values. Depending on the goal, we may 
pursue one of the following: 

• distance-based formation control, where the desired target set ^4 in objective A2 
is given by 

** = {** I M =«**}, d k £R >0 ,k=l,---,£; (2.50) 

• position-based formation control, where the desired target set jz4 in objective A2 
is given by 

^k = {zk\z k = 4}, 4<=RP,k=l,---,l (2.51) 

The goal of the distance-based formation control is to achieve a desired shape of 
the group formation while the position-based formation control concerns not only 
the desired shape but also the desired orientation of the group formation. We first 
consider the position-based formation control and demonstrate that it can be trans- 
formed to an agreement problem. 

The set points zf in (2.51) dictate the relative configuration of the group. When 
the graph contains cycles, the sum of the relative position vectors zj over each cycle 
must be zero; that is, z = [zf , ■ ■ ■ , z\ ] T must lie in the range space of D T g) I p so that 
(2.7) holds. We thus assume that z d — [{z c [) T , • • ■ , (zf) T ] T is designed to lie in the 
range space of D T <g>I p , which means that 

z d = {D T ®Ip)x c (2.52) 

for some x c £ W . The condition (2.52) implies that (2.7) is satisfied. 
Introducing 

x(t):=x(t)—x c — 1jv®v(tWt, 

Jo 

where x c is as in (2.52), and 



(2.53) 



z = (D T ®I p )x = z-z, (2.54) 

we notice that objectives A1-A2 for the position-based formation control translate 
to the asymptotic stability of the origin for 

X=[x T z T ] T . (2.55) 
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According to Corollary 2.1, the global asymptotic stability of X — is guaranteed 
by the protocol 

*i = J%\-Y,d ik y k (z k )\, i=h---,N (2.56) 

where %(•) satisfies (2.46). Using (2.23), (2.53) and (2.54), we rewrite (2.56) in the 
original coordinate (i, z) as 

Xi = yi + v(t) (2.57) 

yi = J%{u,} (2.58) 

where 

e 
u, = -Y,d lk Yk(z k -z d k ). (2.59) 

Corollary 2.2. Consider a group of agents i— 1 , • • • , N. The protocol (2.57)-(2.58), 
where !//# = ^Pk{zk) satisfies (2.46), guarantees that 

|i/-v(0|->0, Vi, (2.60) 

z*^4, V*. (2.61) 

n 

Example 2.2 (Collision avoidance). 

The closed-loop system (2.57)-(2.58) ensures only the convergence to the desired 
formation. Other objectives, such as collision avoidance, can be achieved by incor- 
porating additional feedback terms. For example, to avoid collision, we employ the 
artificial potential field approach in robotics and augment m in (2.58) as 

e n 

Ui = - £ diWkizh - 4) - £ V Xi Qij(\xi -xj\) (2.62) 

fc=l 7=1 

where the C 1 artificial potential function &/(•) : M>o — ► R>o satisfies 

Q tJ (s)^°° as s^O (2.63) 

Qij{s) = as s># (2.64) 

for some positive R. Using the Lyapunov function 

i=l k=l 1=1. />/ 

we obtain 
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N N I N \ 



v<-E^-(«)-E >f E v v , fl; -(|x,--x ; -i 

'•=1 '=1 V /=!,#< 



E E ((V,,G,;(|x, ~^|)) r i,- + (V^Gyfl* -*yl)) V) ■ ( 2 - 66 > 



;V 

E 

i'=l J>i 
Since VxjQyfljj — Jt/|) = — V x .g; ; -(|x,- — x/|), we rewrite V using (2.57) as 

v<-tm)-t t yf(v,,a 7 (|x,--x ; -D) 

+ E E ( (V*Qy(N -^|)) r yi + (V.v,a./(|x, -x ; |)) 7 yj ] 

= -t WW- 1 f yf (^QijQxi-xji)) + tl,FxjQij(\*-xj\)) T yj 

1=1 i=\ j<i i=l j>i 

N 

= -E^(^)<0. (2.67) 

(=1 

Thus, V in (2.65) is nonincreasing, that is, V(t) < V(0). Since V — > °° as |x; — jc/| — > 
0, V; 7^ j, the boundedness of V (t) implies collision avoidance. 

Applying the Invariance Principle, we conclude from (2.67) that t, — > 0, which 
implies from (2.12) that u — > 0. Note that due to the additional term in (2.62) that 
handles collision avoidance, m, — ► does not mean Zk — * zf, that is, convergence to 
the desired formation is not guaranteed. Indeed, there may exist an asymptotically 
stable equilibrium where w, = and the desired formation is not achieved. This 
equilibrium corresponds to a local minimum of the potential function V in (2.65). 
To eliminate such a local minima, one may apply navigation function techniques in 
[113] to the construction of fy and Qij such that from almost all initial conditions, 
the agents converge to the desired formation. We refer interested readers to [133, 
132] for further details on applying navigation function to formation control. □ 



2.6.1 Design Example 

We consider a group of agents modeled as (2.16) in Example 2.1. The feedback law 

(2.17) achieves Step 1. We next apply Step 2 and design U{. According to Corollary 

2.2, we take 

I 

Ui = -Y,d ik \l/k(zk-z(), (2.68) 

i=l 

where y/fc(-) satisfies (2.46). The closed-loop system of (2.19), (2.20) and (2.59) is 
given by 
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I 

mi(xi - v{t))+ki{xi - v{t)) + £ dftYfrfo - 4) = ° ( 2 - 69 ) 

which can be rewritten as 

(M®I p )x+(K®Ip)x+(D®I p )\ir(z) = (2.70) 

where M = diagjmi , • • • ,m7v} and 7T = diagj&i , • • • ,k^}. 

We now show that for quadratic potential function P k , (2.70) recovers a second 
order linear consensus protocol. To this end, we choose a quadratic potential func- 
tion 

Pk = ~\z k -4\ 2 S*eR>o (2.71) 

which leads to 

Wk(zk) = 8 k (z k -z d k ). (2.72) 

The constants S^'s are the feedback gains which regulate the relative emphasis of 
\Zk — zf\ for different k 7 s. Defining 

A=dmg{8 h ---,S i } (2.73) 

and substituting (2.54) in (2.70), we obtain 

(M®I p )x+(K®I p )x+(L A <g>/ p )x = (2.74) 

where L A — DAD T is the weighted graph Laplacian (recall that without the subscript 
"4", L denotes the unweighted Laplacian L — DD T ). The closed-loop system (2.7 4) 
is a second order linear consensus protocol well studied in the literature (see e.g., 
[109]). The design in (2.70) gives a passivity interpretation of the second order 
consensus protocol (2.74) and extends it to nonlinear coupling %. 

Example 2.3 (Agreement of Second-order Agents with Directed Graphs). 

When the graph is undirected, the stability of (2.74) holds for arbitrary m, > 
and ki > 0, V7. For directed graphs, however, {2.1 A) may become unstable even for 
uniform mi and kj. To illustrate this, let us take p = 1 (scalar variables), m, ■ = 1, 
hi = 1, and S k =l, V/, V£, in (2.74), which leads to 

x + x + Lx = (2.75) 

where Lis defined in ( 1 . 1 1 ) . 

To investigate the stability of (2.75), we use the Schur decomposition reviewed 
in Appendix B.l and decompose L as 

L = QBQ l (2.76) 

where Q is a unitary complex matrix and B is an upper triangular matrix with all the 
eigenvalues of L on the diagonal of B. Note that if L is symmetric, i.e., the graph G 
is undirected, Q can chosen as the orthonormal eigenvectors of L and accordingly B 
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Fig. 2.3 Nyquist plot of — , ' j , . Four ®'s denote the inverse of four nonzero eigenvalues of the 
graph Laplacian matrix when the information topology is a directed cycle of 5 agents. 



is a diagonal matrix. We will use this decomposition technique again in Chapter 9 
to study robustness properties of (2.74) with undirected graphs. 
Using a coordinate transformation 



we obtain from (2.75) 



d = Q- 



d + d + Bd = 0, i=l, 



,N. 



(2.77) 



(2.78) 



Because B is upper triangular and because the eigenvalues of L are the diagonal 
elements of B, the stability of (2.78) is equivalent to the stability of 



d + d + lid = 0, i=l,---,N, 



(2.79) 



where A,- is the ;th eigenvalue of L. If A; = for some i, (2.79) is stable. It then 
follows that (2.79) (and thus (2.75)) is stable if and only if the Nyquist plot of 
— -7 ' rr does not encircle A ; for any nonzero A,-. 

The Nyquist plot of — -r+n ^ s snowri by the solid-dash line in Fig. 2.3. For undi- 
rected graphs, A, is nonnegative and the Nyquist plot never encircles nonnegative 
real axis. Therefore, the stability of (2.75) is guaranteed independently of the graph 
and the number of the agents. However, for directed graphs, A, may become com- 
plex and thus the graph and the number of agents may affect stability. For example, 
consider a directed cyclic graph of N agents, where agent (' is the only neighbor of 
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agent 1( 




agent 2 Qj K^J agent 3 



Fig. 2.4 We assign an orientation to an undirected graph of three agents, where every two agents 
are neighbors. The link number is put beside each link. 



agent i + 1, i = 1 , • • • , N — 1, and agent N is the only neighbor of agent 1 . For N < 5 , 
(2.75) is stable. However, for N — 5, there exists two A, such that Kf is encircled 
by the Nyquist plot of — , ' t > , as shown in Fig. 2.3. Thus, (2.75) becomes unstable, 
which implies that for directed graphs, closed-loop stability is sensitive to the graph 
structure and to the number of agents. □ 



2.6.2 A Simulation Example 

In this section, we simulate the position-based formation control system (2.74) and 
demonstrate that different orientations of the formation can be achieved by modify- 
ing ^4's. 

We consider a group of three planar agents (i.e., p — 2), where any two agents 
are neighbors. As shown in Fig. 2.4, we define 

Z1=X2—X1, Z2=X3-X 2 , Z3=Xi-X 3 (2.80) 

and design desired target sets for Zk's to be 

^2 = {Z2\Z2=4=[° -!] r }. (2- 81 > 

M = {z3| Q =4' = [f \Y}. 

We choose M = diag{5,2, 1} and K = 5/3 in (2.74). The reference velocity v(t) 
is zero. The weight A in (2.73) is set to It,. The initial positions of the agents are 
xi (0) = [5 0} T , x 2 (0) = [2 2] r , and x 3 (0) = [0 0} T . Simulation result in Fig. 2.5 
shows that the desired formation is achieved. 

We now modify the desired target sets in (2.81) to 

M = {zi|zi=*? = [-i -% T }, 

^2 = {Z2\Z2=4 = [\ 0} T }, (2.82) 

^ = \z3\z3=4 = H f V}- 
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Fig. 2.5 Using (2.70) and (2.72), three agents converge to the desired formation specified in (2.8 1). 
Agents 1, 2 and 3 are denoted *, o, and o, respectively. 



As shown in Fig. 2.6, (2.82) corresponds to the desired formation in Fig. 2.5 rotated 
counterclockwise by 90 degrees. 
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Fig. 2.6 Using (2.70) and (2.72), three agents converge to the desired formation specified in (2.82). 
Agents 1, 2 and 3 are denoted *, o, and o, respectively. 



We see that the position-based formation control stabilizes both the shape and the 
orientation of the group formation. If the shape of the group formation is the only 
concern, the distance-based formation control studied in the next section is more 
appropriate. 
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2.7 Distance-based Formation Control 

In this section, we study the distance-based formation control problem defined in 
(2.50). In contrast to the position-based formation control, this problem concerns 
only the shape of the group formation. It will become clear that a key complication 
is that Property 2.1, which holds true in the position-based formation control, is no 
longer satisfied in the distance-based formation control with cyclic graphs. Thus, 
additional undesired equilibria may arise due to the cycles in the graph, making 
global stabilization of the desired formation impossible. In the special case of three 
agents, we show that the undesired equilibria are unstable. We then conclude generic 
convergence to the desired formation from almost all initial conditions. 

We also explore existence and uniqueness of the formation shape in this section. 
The existence of a formation shape is related to the requirement in (2.7), which is 
further sharpened to sufficient conditions on the desired target sets. These conditions 
are generalizations of the triangle inequality. We explore the uniqueness issue using 
a four-agent example. If the shape of the desired formation is a rectangle, specifying 
desired relative distances of the four sides is not sufficient since the agents may 
reach a parallelogram instead. In this case, desired relative distances of the diagonal 
links must also be specified to ensure that the rectangle shape is the unique desired 
formation. 



2. 7. 1 Passivity-based Design 

We assume that Step 1 of the passivity-based design has been achieved. We now 
proceed to Step 2 and design the nonlinearities l//fc's. The control objective is to 
stabilize a formation where the relative distances |zj|, k= 1,- •• ,£, are equal to dk > 
0. We choose ^ to be M. p \ {0} and let the potential functions Pk be a function of Zk 
satisfying (2.27)-(2.31). An example of Pk(zk) is given by 

rte\ 
Pk{zk) = / o k { S )ds (2.83) 

Jd k 

where Ok : R>o — > R is a C 1 , strictly increasing function such that 

o k (d k )=0, (2.84) 

and such that, as \zk\ ~^ and as \zk\ — > °°, Pk(zk) -^ °° in (2.83). An illustration of 
Pk(zk) is shown in Fig. 2.7, where 

o k (s) = ^---, (2.85) 



dk s ' 

L ln M 

d k d k 



P k (z k ) = ^-^-l (2-86) 
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Pk(zt) 




Fig. 2.7 The shape of Pk{zk). The minima of Pk(zk) = |z*| — hi|zfc| — 1 occur on the unit circle 
\zk\ = (-h = \. The peak at the origin guarantees the collision avoidance between the linked agents. 



and d\ is set to 1. Note that the condition Pk{zk) -^ °° as |z^| — ► is imposed only 
to ensure collision avoidance between linked agents. Since I\(zk) satisfies (2.27)- 
(2.31), the feedback law m, in (2.23) with the interaction forces 



Wk{zk)^^Pk{zk)^o k {\z k \) r - i z k z k ^0 



(2.87) 



guarantees global asymptotic stability of the desired formation from Theorem 2. 1 
when the graph G is acyclic. 

For cyclic graphs, we need to examine whether or not Property 2.1 is satisfied. 
We consider an example of three agents, where each agent is a neighbor of the other 
two agents, thereby forming a cycle in the graph G. Let the desired formation be 
an equilateral triangle shown in 2.8(a) with <4 = 1, k = 1,2,3. Note that \ffk(zk) 
in (2.87) plays the role of a "spring force" which creates an attraction force when 
\zk\ > 1 and a repulsion force when \zk\ < 1. When m, = 0, additional equilibria arise 
when the point masses are aligned as in Fig. 2.8(b), and the attraction force between 
the two distant masses counterbalances the repulsion force due to the intermediate 
mass. 

To characterize such equilibria, we let the middle agent in Fig. 2.8(b) be agent 2 
and define 

Zi=xi—X2 Z2=X2-xs and zi=x\— X3, (2.88) 

which implies from (2.6) that 
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(a) (b) 

Fig. 2.8 The desired and the undesired formation of three agents with a cyclic graph: The desired 
formation is the equilateral triangle as (a) and the undesired formation (b) is a line. 




D = 

The set of equilibria, given in (2.33), indicates that 

V(z) = [Vi(zi) 7 \V2(z 2 ) J > 3 (z3) T f e.yV(D®I p ). 
A simple computation of the null space of D yields 



ViUiJ 

Vi(zi) 



Vl{Z2) 

-V3(Z3) 



(2.89) 



(2.90) 



(2.91) 
(2.92) 



Since the undesired formation in Fig. 2.8(b) is collinear, A's are the same. We 



then use (2.87) to reduce (2.91) and (2.92) to 

°i(|zi|) = -°s(M) 
Oi(|zi|) = <%(W), 



(2.93) 
(2.94) 



> 2 . 



S3 



■Z2\ 



which have a unique solution (|zi| = Sj, \z2\ 
°it(')> ^ ~ 1 : 2, 3 are increasing and onto. Thus, the set of points where |zi| = s\, 
\z2\ = *2 anc ' l z 3l = - s i +- s 2 constitute new equilibria as in Figure 2.8(b) and such 
desired cannot be eliminated with the choice of the function Ojt(-) . Property 2.1 
then fails in this formation control design and global stabilization of the desired 
formation is not possible for cyclic graphs. 

For agents modeled as double integrators with uniform mass and damping, the 
following example proves that the undesired equilibria in Fig. 2.8(b) are unstable. 
In fact, this instability result can be extended to any graph that contains only one 
cycle. We refer interested readers to [9] for details. 

Example 2.4 (Instability of the undesired formation of three agents). 

Consider the undesired formation in Fig. 2.8(b). We first find out s\ and s\ from 
(2.93) and (2.94). We take ojt(-), k= 1,2,3, as in (2.85) with 4 = 1. It follows from 
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(2.93) and (2.94) that 



"(1- 



(2.95) 



and 



■■s 2 , 



(2.96) 



I -"2 I 



which yield ,s* = s 2 = f. This means that on the undesired formation, \z\ 
and |z 3 | = |. 

We assume that the agents have uniform mass m; = 1 in (2.16) and uniform 
damping A:,- = k > in (2.17). Without loss of generality, we also let v(t) = 0. It then 
follows from (2.16), (2.17), (2.26) and (2.89) that the closed-loop system for these 
three agents is given by 



x\ +kx\ + V(zi) + Vfe) = 
X2 + kx 2 - y/(zi ) + y/(z 2 ) = 
xi+kx 3 - y/-(z 2 ) - Vfe) = 

where 1/a(z#) is obtained from (2.87) and (2.85) as 



(2.97) 
(2.98) 
(2.99) 



Y(zk) 



Zk 



z k 



-1 

\2~ Z k- 



(2.100) 



To show the instability of the undesired formation, we linearize the closed-loop 
system around the undesired formation xi = 0, i = 1,2,3 and Zk = Z?, k = 1,2,3, 
where z u k denotes an undesired equilibrium of Zk- Letting 8zk — Zk~ zt, k = 1,2,3, 
we obtain the linearized dynamics: 
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(2.101) 



An 


A u \ 


X2 

i 3 
8z\ 

8Z2 

[SziJ 


A 2 i 


a 22 ; 



(2.102) 



where 



~dz~\z" 



\z u \-l r 

\,u\2 L V 



,i/|3 



z u (z u ) 



(2.103) 
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To show that the undesired formation is unstable, we only need to demonstrate 
that A has an eigenvalue with positive real part. Towards this end, we solve 



Mi 



\ A 21 



M2 



A22 



for A , and obtain 



-&/*iH-A 12 M2 = A,Ui 

A21JU1 = A;ll2- 

Multiplying (2.105) with A and substituting (2.106) leads to 
A 2 ^i+H^i-Ai 2 A 2 i^i =0. 



(2.104) 



(2.105) 
(2.106) 



(2.107) 



By choosing /ii as the eigenvectors of A12A21, we obtain the eigenvalues of A as 
the solutions to the following equations 



A 2 +JfcA-A, = 0, k>0, i=l,---,3p 



(2.108) 



where A,- is the ith eigenvalue of A12A21 . 
We next compute 



A12A21 = 



/- 


dy\ dy 1 


dy | 


dy\ \ 


dz Iz'/ dz 1^ 


dz \z\ 


dz \z" 3 




dy 1 


dy 1 dy 1 


dy 1 




dz \z\ 


dz \z![ dz \z\ 


dz \z" 2 


V 


dy 1 


dy 1 


dy \ dy \ 


dz \z\ 


dz \z\ 



(2.109) 



Note from (2.103) that -J^l u ,k= 1,2,3, are symmetric. Thus, A12A21 is also sym- 

OZ \Z k 

metric. Then if the matrix A12A21 has a positive eigenvalue, there exists a positive 
root of (2.108) and therefore A is unstable. 

To show that A12A21 has a positive eigenvalue, we recall that on the undesired 
formation, z^'s are collinear, which means that there exists a z € K p such that z -L z\, 
\/k. This implies from (2.103) that 



~dz~\ 



1 



*?■■■ 



(2.110) 



Choosing £ = [a b c] T ®z, where the scalars a,b,c will be specified later, and using 
(2.110), we obtain 



Ai2A 2 i£ : 
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14 
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(2.111) 
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Fig. 2.9 The unstable eigenvector of the undesired formation corresponds to motion in the direc- 
tion indicated by the bold arrows. 



We now substitute \z l ! 



\X\ 



and 



SI 



into (2.111) and get 



Ai2A 2 iC = 




(2.112) 



The matrix B has a positive eigenvalue X = I associated with an eigenvector 

[—1 2 — l] T . By choosing [a b c] T = [—12 — l] 7 ", we rewrite (2.112) as 



Ai2A 2 iC = AC, 



(2.113) 



which shows that A12A21 has a positive eigenvalue A. Therefore, the undesired for- 
mation in 2.8(b) is unstable. 

The unstable eigenvector [—1 2 — l] r <g>z corresponds to motion in the direction 
shown by the bold arrows in Fig. 2.9. We interpret the unstable growth in this di- 
rection by returning to the mass-spring analogy. Since \z"\ = |zj>| < 1 and |z"| > 1, 
springs 1 and 2 are squeezed while spring 3 is stretched. The motion in Fig. 2.9 
increases \z\ | and \zi\ towards their natural length of one. □ 

Because the undesired formation in Fig. 2.8(b) is unstable, we conclude generic 
convergence to the desired formation in Fig. 2.8(a) from all initial conditions except 
for those that lie on the stable manifolds of the unstable equilibria. The numerical 
example in Fig. 2.10 shows the convergence to the desired formation with the design 
(2.16) and (2.17) for three agents. In this example, the reference velocity v(t) is 
chosen as [0.1 0.1] r . 
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Fig. 2.10 Snapshots of the formation for the passivity-based design (2.16) and (2.17): The three 
relative positions, z\ , Zt and 23 denote x\ — X2, *3 — xi and x-i — X3. The agents x\ , xi and xt, are 
represented by *, o and o, respectively. 



2.7.2 Existence and Uniqueness of a Formation Shape 



A key consideration in the distance-based formation control problem is whether a 
given set of desired relative distances d^'s, even admits an equilibrium in the closed 
loop, and, if so, whether the equilibrium is unique. 



2.7 Distance-based Formation Control 



agent 2 



43 



agent 1 



agent 3 




agent N 



Q agent N — 1 



Fig. 2.11 The ring graph of JV agents. The directions of the N links are assigned such that the 
positive end of each link is the negative end of the next link in the sequence. 



Given d^'s, a desired formation equilibrium exists if the constraint (2.7) is satis- 
fied. If the graph is acyclic, (2.7) holds for any df, > 0, k = 1 ,■■■,£. So we only need 
to consider the cyclic graph case. 

As an example, consider a group of N agents that form a ring graph, i.e., each 
agent has exactly two neighbors. A ring graph has only one cycle and the numbers 
of links and nodes are the same, i.e., N — t. As shown in Fig. 2.11, we assign the 
orientation of the ring graph such that the positive end of each link is the negative 
end of the next link in the sequence. We define zi =^,+i — x,, V/ = 1, ■ ■ ■ ,7V 1 and 
Zn = x\ — xn , and obtain 

N 

zj = - L *. V i- ( 2 - 114 > 

This equality must be satisfied at the desired formation. Therefore, we obtain 



M = l t al< t h\, vj 

N 

=>dj< Y, d k, V./'. 

k=\,kH 



(2.115) 
(2.116) 



When TV = 3 and the desired formation is a triangle, (2.116) reduces to the triangle 
inequality, that is, the sum of the lengths of any two sides of the triangle must be 
greater than the length of the other side. Thus, the choice of <4 is constrained by 
(2. 1 16). If the graph contains multiple cycles, multiple constraints similar to (2. 116) 
must be satisfied for <4's so that a desired formation exists. 

Once we establish that a desired formation exists for a given set ofd^'s, the shape 
of the desired formation may not be unique if we do not specify enough number of 
desired relative distances. We illustrate this using a four-agent example below. More 
formal analysis on the uniqueness of a formation shape using "rigidity" can be found 
in [101, 46, 72, 43]. 



Example 2.5 (Stabilizing a rectangle formation of four agents). 
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Fig. 2.12 The square formation can collapse to a parallelogram or eventually to a line. 





(a) (b) 

Fig. 2.13 Two possible desired formations when one diagonal link is specified. 



Suppose that we want to stabilize four agents to a rectangle formation (solid 
lines in Fig. 2.12). Initially, we only specify the desired relative distances of the 
four sides. As illustrated in Fig. 2.12, the agents may converge to a rectangle or to 
a parallelogram or even to a line since all those shapes are in the target set (2.50). 
In fact, there exist infinitely many formations (up to rigid translation and rotation) 
in (2.50). Thus, specifying the lengths of four sides is not enough to guarantee the 
desired rectangle formation (up to rigid translation and rotation). 

We then add a diagonal link and specify its length. Then there exist only two 
possible formations (up to rigid translation and rotation) in Fig. 2.13. Thus, if the 
agents converge to the target set (2.50), they will converge to either of the two shapes 
in Fig. 2.13. 

If we also specify the length of the other diagonal link as shown in Fig. 2.14, we 
eliminate the existence of the formation in Fig. 2.13(b). In this case, if the agents 
converge to the desired equilibria, they converge to the desired rectangle formation. 

□ 



2.8 Distance-based or Position-based? 



We have seen two types of formation control, both of which can be designed with 
the passivity-based framework. We now present a comparison between these two 
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Fig. 2.14 If we specify the desired lengths of all the links, the desired formation is unique. 
x\ X\ 

Q 




(a) 




(b) 



Fig. 2.15 (a) Desired formation in terms of z£, k = 1,2,3. (b) Initial formation of the three agents. 
Even if the initial formation (b) is an equilateral triangle, agents 2 and 3 will still swap their 
positions to match the desired formation in (a). 



formulations and illustrate the situations under which one formulation is preferable 
to the other. 
• Equilibria. 

For the distance-based formation control, the desired equilibria in (2.50) are 
spheres while for the position-based formation control, the desired equilibrium in 
(2.51) is a single point. The difference in these equilibria sets reflects different con- 
trol objectives. When the agents need to maintain specific bearings and distances 
with respect to their neighbors, the position-based formation control is more suit- 
able. In the case where cooperative tasks only require the shape of the formation 
rather than a specific orientation of the formation, the distance-based formation 
control is preferable. This is because the position-based formation control may put 
stringent constraints on relative positions and sacrifice the flexibility of the group 
motion, as we illustrate below. 

Example 2.6. Consider a group of three agents xi £ R 2 , i = 1,2,3. Suppose that the 
desired formation is an equilateral triangle with side length 1. One way to achieve 
this desired formation is to apply the position-based formation control by specifying 
desired relative positions between agents. We let z\ = *2 — X\, z% = xt, — *2 an d 



</3i Ji 
2 J' Z 2 



[1 0] r and d = [\ - yV in the 



H = x-i — x\, and choose zf = [— \ 
frame of E, as in Figure 2.15(a). 

Let the initial formation of the three agents shown in Figure 2.15(b) be z\ (0) = zi, 
Z2(0) = — zf and 23(0) = zf, which means that it is already an equilateral triangle 
with side length 1 . However, this equilateral formation does not match the desired 
targets (2.51). Since (2.51) is globally attractive by the position-based formation 
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Fig. 2.16 Four planar agents in a global frame E: fi, represents a local frame for agent ;'. The 
desired formation is shown as an equilateral square. 



control design, the agents will start moving away from the initial formation in Fig- 
ure 2.15(b) towards the desired formation in Figure 2.15(a), which results in unnec- 
essary time and control energy consumption. □ 

• Control Design and Stability. 

The difference in equilibria leads to different control designs: The design in 
Section 2.7.1 employs nonlinear potential functions to achieve the distance-based 
formation control while position-based formation control can be realized by linear 
feedback laws, such as the design in (2.74). 

Moreover, because of the difference in equilibria sets, the distance-based for- 
mation control stabilizes the desired formation only locally when the graph con- 
tains cycles while the position-based formation control is able to globally stabilize 
the desired formation. This is because Property 2.1 is satisfied for the position- 
based formation control but not for the distance-based formation control. Because 
the position-based formation control is globally stabilizable, it has been applied 
to different cooperative control problems, including formation control of unicycles 
[83,41]. 

• Desired Formation Specification. 

The difference in the equilibria sets (2.50) and (2.51) also results in different 
specifications of the desired formation. For example, consider four planar agents in 
Fig. 2.16. The coordinate £ is a global frame while Ri,i = 1, • • • ,4, is agent i's local 
frame, possibly different from E. The desired formation is an equilateral square 
shown in Fig. 2.16. To specify a desired formation using (2.50), one only needs to 
determine the desired relative distances, which is invariant in different frames. This 
implies that the desired distances can be specified in either E or Rj's. However, for 
position-based formation control, the desired relative position z d must be prescribed 
in one common frame, such as E in Fig. 2.16. 

One subtlety in specifying a desired formation is how to guarantee a unique de- 
sired formation. For position-based formation control, specifying desired relative 
positions ofN—\ links is sufficient if these N—l links can form a connected graph. 
This is because once the desired relative positions of these N —\ links are fixed, the 
desired relative positions between any two agents are also fixed. As an illustration, 
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consider Fig. 2.16 and suppose that we specify the relative positions for the links 
between agents 1 and 2, between agents 2 and 3, and between agents 3 and 4. These 
three links form a connected graph and thus all the other relative positions, such 
as relative positions between agents 4 and 1, between agents 2 and 4, are uniquely 
determined. However, for distance-based formation control, specifying the desired 
lengths of N — 1 links may not enough to ensure a unique formation, as we already 
illustrated in Example 2.5. 
• Implementation of Control Laws. 

In practice, the relative position Zk's are obtained in each agent's local frame. If 
precise global frame information is available, the agents may transform the local 
measurements of Zk to the global coordinates for implementation. However, in some 
applications, such as space interferometry sensing, the global frame information 
may be imprecise or unavailable. In this case, we show that distance-based forma- 
tion control can be easily implemented without knowledge of any global frame in- 
formation while position-based formation control requires the knowledge of a com- 
mon frame in which the desired relative positions z d are specified. 

For illustration, we assume double integrator dynamics for the agents and rewrite 
the control laws for distance-based and position-based formation control as 

I 

position-based: xi = T; = — kfXi — ^dtk(Zk — zf) (2.117) 

i=\ 

t |z I 1 

distance-based: x\ = T,- = — kfXj — V dn c log( — )-. — -Zk (2.118) 

f~[ ' d k \z k \ 

where we take v(t) = 0„. Suppose that (2.1 17)-(2.1 18) are written in a global frame 
E. Then zf's must be specified in E. When E is not available, each agent imple- 
ments 't,, the Tj vector represented in agent i's frame 2 . Let Rj be the agent i's frame 
represented in E. Then ! T,'s are given by 



position-based: 't,- = -kfij x t - £ d ik {Rj z k - RJ4) (2.1 19) 

distance-based: % = -kiRjxi- T d ik log( l -^-) r -rRjzk- (2.120) 

■rj d k \zk\ 

It then becomes evident that both (2. 1 19) and (2. 120) require agent i's velocity rep- 
resented in Ri (i.e., the term Rfxi) and the relative position Zk represented in R{ 
(i.e., the term Rfzk)- In addition, the position-based formation control also needs 
Rfzt, which cannot be computed if the global frame E is not known. Thus, we con- 
clude that distance-based formation control is more suitable for applications with 
no global frame information. 



2 For vector representations in different frames, we refer readers to Appendix B.12.1. 
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2.9 Summary 

In this chapter, we employed passivity as a design tool for a class of group co- 
ordination problems where the topology of information exchange between agents 
is bidirectional. We exploited the symmetry inherent in the undirected graph and 
represented it as a passivity-preserving structure (pre-multiplication of D T ® I p and 
post-multiplication by its transpose as in Fig. 2.2). We used this structure to develop 
a passivity-based design framework that yields a broad class of decentralized and 
scalable cooperative control laws for complex and heterogeneous agent dynamics. 
In addition to stabilizing feedback rules, the passivity-based design framework con- 
structs a Lur'e-type Lyapunov function. As we will illustrate in Chapters 3, 4 and 
6, this Lyapunov function serves as a starting point for several adaptive designs that 
enhance robustness of group motion. 

We next applied the passivity-based design framework to agreement problems. 
We developed a class of decentralized protocols that achieve agreement of agents. 
We also studied the position-based and the distance-based formation control. For 
the position-based formation control, we showed that it can be transformed to an 
agreement problem, which means that the desired formation is guaranteed. In the 
distance-based formation control, we showed that the desired formation is only lo- 
cally asymptotically stable for cyclic graphs because Property 2.1 fails. We then 
proved the instability of the undesired formations for a three-agent example and 
concluded generic convergence to the desired formation. We also discussed how to 
specify a unique and feasible formation shape. Finally, a comparison between the 
position-based and the distance-based formation control was presented. 



2.10 Notes and Related Literature 

• The use of Schur decomposition in Example 2.3 follows [47, Theorem 3]. 

• The passivity-based framework in this chapter was developed in [5]. 

• Related Literature on agreement and formation control: A rapidly-growing lit- 
erature has been witnessed in the field of agreement. See e.g., [102, 109] for a 
summary. Applications of formation control can be found in the survey papers 
[117, 118, 26, 93]. Reference [98] first applied potential function method to the 
formation control with undirected information topology. A flocking algorithm was 
studied in [131] under time-varying communication graphs. In [47], the formation of 
multiple vehicles with linear identical dynamics was investigated. Based on a decen- 
tralized simultaneous estimation and control framework, the authors in [150] studied 
formation control using geometric moments. In [44], the position-based formation 
control was formulated as an optimization problem and a distributed receding hori- 
zon controller was proposed. Reference [123] considered optimal formation control 
of linear agent dynamics by using relative position and communicating estimates. 
In [124], a parallel estimator was developed for controlling formation of linear dy- 
namical agents with directed graphs. Reference [152] employed Jacobi shape theory 
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to decouple translational formation dynamics from shape and orientation dynamics. 
The proposed cooperative control laws locally stabilize the desired formation shape. 
Formation control with directed graphs has also been investigated in [22, 23, 55, 4]. 

• For directed graphs, significant results have been obtained using a number of dif- 
ferent approaches, such as the use of Laplacian properties for directed graphs in 
[103, 47, 109, 79], input-to-state stability [135], passive decomposition of group 
dynamics [76], eigenvalue structure of circulant matrices [87], set- valued Lyapunov 
theory in [90], and contraction analysis [31, 32]. In particular, recent research in 
[29, 28] also employed passivity as a tool for agreement of nonlinear systems. The 
results [29, 28] are applicable to strongly connected directed graphs for relative de- 
gree one agents. The passivity-based framework in this book allows agent dynamics 
to be relative degree higher than one for undirected graphs. 

• Step 1 in the passivity-based framework may not be applicable to certain classes 
of dynamical systems, such as nonholonomic agents. Significant research has been 
conducted when the agents are modeled as unicycles. In [77], the authors consid- 
ered a group of unit speed unicycles and proposed designs to achieve different group 
formations. A leader-following approach was introduced in [48] to ensure a de- 
sired group formation, where each unicycle maintains desired relative bearings and 
distances with respect to its neighbors. The control algorithms were based input- 
output linearization. Reference [88] studied cooperative formation control of mul- 
tiple unicycles by assigning each agent a desired trajectory to track. The tracking 
errors decrease as feedback gains increase. In [83], formation control of unicycles 
was studied in the position-based formulation and necessary and sufficient graphical 
conditions were obtained. Reference [87] employed eigenvalue structure of circu- 
lant matrices in cyclic pursuit formation. Agreement of positions and orientations of 
unicycles was considered in [37] and discontinuous time-invariant control laws were 
analyzed using nonsmooth analysis. For dynamical nonholonomic agents, backstep- 
ping is a useful tool to transform coordination laws from kinematic level to dynamic 
level [42, 39]. In [50], formation control with general agent dynamics was formu- 
lated as a nonlinear output regulation problem. 



Chapter 3 

Adaptive Design for Reference Velocity 

Recovery: Internal Model Approach 



3.1 Introduction 

The passivity-based design in Chapter 2 assumed that the reference velocity of the 
group is available to each agent and developed control laws that made use of this 
information. A more realistic situation is when a leader, say agent 1, in the group 
possesses this information. In this chapter, we exploit the design flexibility offered 
by the passivity-based framework and develop adaptive designs with which the other 
agents reconstruct reference velocity information. We first illustrate with an exam- 
ple that if the agents do not have the same reference velocity information, objectives 
Al and A2 cannot be guaranteed. Assuming that the reference velocity is constant or 
periodically time-varying, we then propose a basic adaptive design that allows the 
other agents estimate the reference velocity. This basic adaptive design preserves the 
passivity properties proven in Theorem 2.1 and recovers objective A2. The deriva- 
tion of this adaptive design follows techniques from the regulation problem studied 
in [21, 19, 64] and is referred to as the internal model approach because this design 
contains a model of the dynamic structure of the reference velocity. 

With the basic adaptive design, we next present an example which shows that 
the estimates of the reference velocity may not converge to their true value when 
the reference velocity is time-varying. This means that tracking of the reference 
velocity (objective Al) is not guaranteed. However, the basic adaptive design guar- 
antees tracking of the reference velocity in several special cases, such as, when the 
reference velocity is constant, or when the desired target set is the origin. 

In the situation where tracking of reference velocity fails for the basic adaptive 
design, we propose an augmented adaptive design to guarantee tracking of the ref- 
erence velocity without the restrictions discussed above. The main idea in the aug- 
mented design is to ensure that the relative velocities between agents converge to 
zero, thereby guaranteeing that all agents converge to the reference velocity. By in- 
cluding the relative velocity feedback in the redesign, we recover the stability result 
of the basic adaptive design while achieving tracking of the reference velocity. 
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3 Adaptive Design for Reference Velocity Recovery: Internal Model Approach 



3.2 Why Adaptation? 



The design (2.8), (2.1 1) and (2.23) assumes that the reference velocity v(t) is avail- 
able to each agent. In practice, the reference velocity information may be available 
only to a leader, say, the first agent, while the other agents may have incorrect v(t) 
information. We then let v,-(f) be the reference velocity of agent i,i— 1, • • • ,N, where 
Vi (t) = v(t) and v,(?) ^ v(t), Vi 7^ 1, which means that agents i = 2, • • • ,N, have in- 
correct reference velocity information while agent 1 has the correct one. We now 
consider the example in (2.69) and examine the performance of the group motion 
due to different v,(f)'s. 

For simplicity, we assume that nij — 1, Pk{zk) satisfies (2.71) with 8k — 1, and 
v,(f)'s are constant. Using (2.69), we obtain the closed-loop system 



where v 



,J~\T 



)"AfJ 



x = -k(x -v)-(D®I p )(z- 1 
Denoting 



(3.1) 



Vi = vt 



N 






and v 



[v~iV 



• v T ^ T 



, we obtain from (3.1) that 



x = —k 



L-l N ®f t ~v i )-(D®I p )(z-z? i )- 



kv. 



Using the transformation 

x(f) :=x(t)-x c - 

where x c is as in (2.52), and recalling (2.54), we rewrite (3.3) as 

x + £x+ (L(g)/p)x = kv. 




(3.2) 



(3.3) 



(3.4) 



(3.5) 



According to the result in Section 2.6.1, if kv = 0, system (3.5) has a globally 
asymptotically stable equilibrium at x = and z = 0. Thus, nonzero kv can be 
considered as a constant disturbance to (3.5). This constant disturbance shifts the 
equilibrium of (3.5). To find out this shifted equilibrium, we rewrite (3.5) as 



-kI N -D 
D T t 



and compute the equilibrium of (3.6) by solving 



-kI N -D 
D T f 



(3.6) 



(3.7) 
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which leads to 

(D T (g>I p )x = (3.8) 

and 

-kx+(D®I p )z = -kv. (3.9) 

Using the fact that 1# spans jV[D T ') and noting from (3.2) that 

(l N ®I p ) T (-kv) = 0, (3.10) 

we obtain from (3.9) that 

(l N ®I p ) T {-kk+(D®I p )z) = -k(l N ®I p ) T ± = (l N ®I p ) T (-kv) =0. (3.11) 

Because (3.8) implies that x = 1# (g) c for some c € W , it follows from (3.9) and 
(3.11) that 

x = (3.12) 

and 

(D®I p )z=-kv. (3.13) 

Since (3.10) implies that — kv _L ,yV(D T ®I p ), —kv lies in M{D®I p ), which means 
that there exists a nonzero z satisfying (D®I p )z = — kv. Then the equilibrium of 
(3.6) is given by 

x = and z = z. (3.14) 

We note from (3.4) that x = means that the velocities of all the agents converge 
to j, Y4L1 v i rather than the correct reference velocity v. The steady state of z is also 
shifted to z 7^ 0, which implies that the desired target z = zf 1 cannot be reached. 

From this simple example, we see that if the agents do not share the same ref- 
erence velocity information, steady state errors arise in both the reference velocity 
tracking and the convergence to the desired target sets. In the following sections, 
we will propose adaptive designs with which the agents recover the reference veloc- 
ity information and eliminate the steady state errors. In particular, we will present 
two approaches: the Internal Model approach and the Parameterization approach. 
The internal model approach assumes the reference velocity to be constant or peri- 
odically time-varying while the parameterization approach developed in Chapter 4 
parameterizes the reference velocity as a linear combination of known time-varying 
basis functions with unknown constant coefficients. 



3.3 Internal Model Approach: The Basic Design 

Let v n (t) € R be the nth element of the reference velocity v(t) € R'\ n= 1, • • • ,p. 
In this section, we consider the situation where v"(f), n = 1,« •• ,p, are generated by 
the following exogenous systems: 
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if =Ar] n 
v"(t) =H"ri", 



(3.15) 
(3.16) 



in which A e Ri x i satisfies A = -A T , 77" e R 9 , 77" (0) is arbitrary and H" e M lx <?. 
The skew symmetry of A implies that the eigenvalues of A lie on the imaginary 
axis and thus v(t ) can be constant or periodically time-varying. We further make the 
following assumptions: 

Assumption 1 The matrix A is available to all the agents. □ 

Assumption 1 is reasonable in the situation where the general dynamic structure 
of the reference velocity is known, but the observation matrix H" and the initial 
conditions of 77" (0) may be chosen by the leader. This is relevant, for example, 
when the leader makes decisions autonomously in real time. 

Assumption 2 (A,H") is observable, n — 1 , • • • , p. □ 

Assumption 2 is standard. Denoting 



( H " \ 
H n A 



£m = 



vqxq 



n = 1, • 



\H n A c <- 1 j 



and 



^ H = diag{^ //1 ,---,^ // p}, 



we note that Assumption 2 is equivalent to J?// being full rank. 
For compactness, we introduce 



n 



[(V) r ,-,W) r ] T eR« 



and 



(/„ ® A) G 



xpq 



// = diag{H 1 , ■■■,H"}e'. 



IX pq 



and rewrite v(t) from (3.15) and (3.16) as 



77 = A77 
v(t) =Hr\. 



(3.17) 



(3.18) 



(3.19) 



(3.20) 



(3.21) 
(3.22) 



The first step in the basic adaptive design is to modify the internal feedback loop 
to assign an estimate of v(t) to agent i. To this end, we modify (2.8) as 



xi =yi+v(r) 
xi =yi + n(t) i 



,N, 



(3.23) 
(3.24) 



where v, is agent i's estimate of v(t). As demonstrated in Section 3.4, such a modi- 
fication can be easily achieved based on the design in Step 1 in Chapter 2. 



3.3 Internal Model Approach: The Basic Design 



55 



We next make use of the feedback (2.23) and design an update law for v,. Let 
v" and uf, n= 1, • • • ,p, be the nth element of v, and m, respectively. For agent i, 

i = 2, ■■■ ,N, we design 



n=B n i®h n=l,-,q, 



(3.25) 
(3.26) 



where GTf € R 1 ?, GJf(O) is arbitrary, and fi? G K 1 xc/ . Note that the design (3.25)-(3.26) 
contains the model information A of the reference velocity. The main restriction of 
this design is that (A,Bf) be observable, that is, 



B"A 



£r" = 



yqxq 



be full rank. 



(3.27) 



Letting 

tBt=[(m}) T ,-,(mf) T ] T eRM and B i = dmg{Bl- 

we obtain the compact form of (3.25) and (3.26) as 

CJ, = AUJi + Bf ui 
\'i = BitS{. 



,8f}eK" xw , (3.28) 



(3.29) 
(3.30) 



The block diagram of (3.29)-(3.30) is shown in Fig. 3.1, which resembles the 
passivity-preserving Structure 1 in Section 1.5. In fact, because A — — A T , using 
the storage function ifiJ^GT,-, we prove the passivity from Bjiij to GJ, in Fig. 3.1. 
According to Structure 1, we obtain the following passivity result: 



Lemma 3.1. The system in (3.29)-(3.30) is passive from m,- to v,(?). 



□ 



Lemma 3.1 is essential in establishing the closed-loop stability of the basic adaptive 
design in Theorem 3.1. 




Fig. 3.1 The blockdiagram of (3.29)-(3.30) resembles the passivity-preserving Structure 1 in Sec- 
tion 1.5. The passivity from u\ to v; can be established by using the storage function \wj W[. 



The goal of the adaptive design is to recover the convergence results \xi — v{t) \ — > 
and u = -(D(g>I p )y(z) — > in Theorem 2.1. Note from (3.24) that \x t - v(t)\ -> 
0, Vi = 2,' ■ ■ ,N, is ensured by y,- — » and |v,(r) — v(t)\ — > 0. Lemma 3.2 below 
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employs the observability conditions of (A,H n ) and (A,B") and proves that when 
u = 0, (3.29)-(3.30) has a unique solution GJ,(?) such that v,(f) in (3.30) is equal 
to v(t) in (3.22). This allows us to transform the convergence |v,(?) — v(t)\ — > to 
\Wi-Wi\^0. 

Lemma 3.2. Define 

Z? = £ B ,} £ H » , n=l,---,p, i=l, — ,N, (3.31) 

where J3h" is in (3.17) and J3s n is in (3.27). Let 

r, = diag{i:, 1 ,...,2:, p }, (3.32) 

and 

W l (t)=E i r}(t) (3.33) 

where 77 (?) is as in (3.21). Then, when Uj = 0, &i{t) is the unique solution to (3.29)- 
(3.30) such that v,(f) = v(t), where v(t) is as in (3.22). □ 

We next define 

m{t) = [(joh - ® 2 ) T , • • • , (a N - m N ) T ] T (3.34) 

where GJ,- is in (3.33), and note that the equilibria set of (3.21)-(3.24), (3.29) and 
(3.30) translates to 

S* = {(z,§,»)| (§, z) € <? and ro = 0} (3.35) 

where <? is as in (2.33). This equilibria set includes the desired equilibria set 

^* = {(z,|,ft)| (§, z)e^andW = 0}, (3.36) 

where si is in (2.36). Theorem 3.1 below makes use of the passivity property 
in Lemma 3.1 and proves the stability of &/*. It also shows that the trajectories 
(E, (t) , z(t)) converge to $ . Whether the convergence G7 — > is achieved or not de- 
pends on the properties of the reference velocity, the desired target sets srf\ and the 
potential functions Pk{zk)- We will elaborate on this after Theorem 3.1. 

Theorem 3.1. Consider the coordination laws in (3.23) and (3.24) where v(t) and 
v,(?) are given by (3.21)-(3.22) and (3.29)-(3.30), in which A is in (3.20) and A = 
—A T , Ui is defined in (2.23) in which Xjf^, k — !,■■■,£, are designed as in (2.27)- 
(2.31), andyi is the output of J^ with the input Ui, i — 1, • ■ ■ ,N, in which the passive 
block Jtf is designed as in (2.11)-(2.15). Suppose that (3.27) holds. Then, 

i) The desired equilibria set srf* in (3.36) is stable; 

ii) All trajectories (z(t),^(t),fil(t)) starting in S? X Rw -1 ) are bounded and 
converge to S x M^-i), wnere g j s as ,„ (2.33); 

Hi) If Property 2.1 holds, all trajectories (z(t) ,^(t), 07(f)) starting in <$ x W'( N ~ ') 
converge to #f x W^ N ~ r K D 
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Proof (Lemma 3.2). When m, = 0, (3.29)-(3.30) reduces to 

G7, = AWi 
Vj{t) = BtWi(t). 

Note that the definition of Ef in (3.31) leads to 



BIAZ? 



BfA^Ef 
\ B?A«Ef ) 



( H" \ 
H"A 



\ H"Ai J 



which implies 
and 



B"A 



\B'^A c i- 1 ) 



AT," 



B"E" = H n 



( H "- \ 

H n A 



\H n Ai- 1 J 



A^AE" = E"A 



i i - 



B B n Bjjn 

It then follows from (3.20), (3.28) and (3.32) that 



and 



AEi = E t A. 
Using (3.21), (3.22), (3.33), (3.42) and (3.43), we obtain 

GJ, = AtD 



and 



B i m i = Hr\=v{t) 1 
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(3.37) 
(3.38) 



(3.39) 



(3.40) 



(3.41) 



(3.42) 
(3.43) 

(3.44) 
(3.45) 



which ensure that GJ, is a solution to (3.37) such that v,(f) in (3.38) is equal to v(t) 
in (3.22). 

We now prove the uniqueness of GJ, by contradiction. Suppose that there exists 
another solution GJ,(?) to (3.37) so that v,-(f) = v(r). Then let GJ<. ( = GJ,- — GJ, and note 
that 

m ej =Am ei (3.46) 



and 



fi,GL = 0. 



(3.47) 
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It follows from the observability of (A,B") that (A,Bj) is observable, which implies 
from (3.47) that GT e . = 0. Thus, GJ,- must be unique. D 

Proof (Theorem 3.1). We consider the storage functions VAz) and Vb(%) in (2.35) 
and the following storage function 



Vp(d7) = -m T U5. (3.48) 



To compute the time derivative of V/(z), we define 

v i (t) = v i {t)-v(t), i = 2,---,N, (3.49) 

and note from (3.22), (3.30), (3.33) and (3.42) that 

v i {t)=B i {m i {t)-Z i r]{t))=B i & i . (3.50) 

For consistency, we let v\ (t) = p . We obtain from (2.6), (3.23) and (3.24) 

z= {D T ®I p )x (3.51) 

= {D T ®l p ){\ N ®v(t)+y + v} (3.52) 



where 
Noting the fact 



v=[v T u vl,---,v T N ] T . (3.53) 



(D T ®I p )(l N ®v(t))=0, (3.54) 

which follows from the third item in Property 1.5, we rewrite (3.52) as 

z=(D T ®I p )(y + v). (3.55) 

We use (3.55) and compute Vf as 

V f = \i/ T (D T ®I p )(y + v) 

= {(D®I p )y,} T (y + v) 

= -u T (y + v). (3.56) 

The time derivative of Vj,(<f; ) is the same as (2.40). 

Having computed Vf(z) and %(%), we now proceed to computing V p (W). From 
(3.34), (3.29), (3.30), (3.33), (3.21), (3.22) and (3.43), we obtain 

Gfi = AWi + Bfui - EiAri (3.57) 

= A{®i-Lii])+B]ui (3.58) 

= AWi+Bjui (3.59) 

which, together with the passivity result in Lemma 3.1 and (3.50), results in 
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N 
V p = 1 £ujB i ®i (3.60) 

i=2 

= u T v. (3.61) 

Thus, the time derivative of V — Vf + Vb + V p yields 

? = -Ew,(&)-£«frt<o (3-62) 

which proves the stability of (3.36). Because the closed-loop system (2.11), (3.55) 
and (3.59) is time invariant, we apply the Invariance Principle and analyze the largest 
invariant set Jt where V — 0. It then follows that m — 0, V/ ^ <f and that t,i = 0, V7 e 
J* . We further note from (2.12) that |,- = 0, i e / implies m,- = 0, which, together 
with ut = 0, Vi ^ ^, proves the convergence of the trajectories (z(t),^ (t),GS(t)) to 
S x WP( N ~ l \ where # is as in (2.33). In particular, when Property 2.1 holds, u = 
means zj£4 and thus all trajectories (z(t), t;(t),G}(t)) starting in & x 1 W ' A ' _1 ' 
converge to the set srf x W*( N ~ ^ . 

The proof of Theorem 3.1 shows that the trajectories (z(t) , !; (t) , GS (t)) converge 
to an invariant set ^# where E, = and u = 0, which means that objective A2 is 
achieved at least locally and that \x\ — v{t) \ — > and \xi — vt(t) \ — * 0, i = 2, • • • ,iV. If, 
in addition, v,- converges to v(t), the agents all reach the reference velocity, thereby 
guaranteeing objective Al. The next corollary characterizes two possible situations 
where objective Al is ensured. 

Corollary 3.1. Suppose that the assumptions in Theorem 3.1 hold. If, in addition, 
one of the following conditions holds: 

1. A in (3.15) is a zero matrix; 

2. Property 2. 1 holds and ^4 is a point, V£ = 1 , • • • , I, 

then all trajectories (z(t),£, (t), 6f(t)) converge to the equilibria set S* in (3.35). In 
particular, & — *Qand |v,-(f)— v(t)\ — > 0, (' = 2,- •• ,N. Moreover, if condition 2 holds, 
the set &/* in (3.36) is uniformly asymptotically stable with the region of attraction 

If condition 1 is satisfied in Lemma 3.1, the reference velocity in (3.21)-(3.22) is 
constant and vt(t) then acts as an integral control that eliminates the constant distur- 
bance due to the different reference velocity information. An example of condition 
2 is the agreement problem, where ^ — and Property 2. 1 is satisfied due to (2.46). 
Note that condition 2 restricts the desired target sets gfy but not v(t) while condition 
1 restricts v(t) instead of ^4. 

Proof (Corollary 3.1). Noting the results in Theorem 3.1, we only need to prove 
ft) —> and | \>j(t ) — v(t) \ — > 0, i = 2, • • • ,N for both conditions. For condition 1, v(t) 
in (3.22) is constant since A in (3.21) is a zero matrix. Because «-0on ^C, we 
conclude from (3.29) and (3.30) that v,(?) is constant on .-#. For dynamic block M'i, 
we obtain from £,■ = 0, m, = 0, (2.11) and (2.13) that v, = 0. Likewise, the static 
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block satisfies >>,■ = hi(uj), which implies that yi vanishes on M. ' . Using (3.23) and 
(3.24), we have x\ = v and x,- = v,-, where v and v; are constant. Since z — (D T (g) I p )x, 
z is also constant and thus the only way that z(t) can stay in the bounded invariant 
set j% is when z — 0. We then note from (3.55) that (D T ® I p )v = 0. Recalling that 
vi = 0, we conclude v = and | v,- — v(t) \ — > 0, i = 2, • • • ,N. The convergence (3^0 
follows from the uniqueness property in Lemma 3.2. 

For condition 2, note that since Property 2. 1 is satisfied, z is constant on ,,#. Thus, 
z = 0. Following an analysis similar to that for condition 1, we obtain |v, — v(t) | — > 0, 
j = 2,---,ATandffl— >0. D 



3.4 Design Examples for Distance-based Formation Control 

In this section, we illustrate two examples of the basic adaptive design: The first one 
assumes a constant v(t), and the second one shows that for a time-varying v{t), v,- 
may fail to converge to v(t), which motivates us to redesign the feedback (2.23) in 
Section 3.5. 

We consider the formation control of three agents, where agent 1 is the leader, 
which means that v(t) is available only to agent 1. In this example, we assume that 
the agent model is given by (2.16) with m,- = \,i= 1,2,3. The desired formation is 
an equilateral triangle with side length 1 . 

The dynamics of agent 1 remain the same as in (2.19) and (2.20). For the agents 
i ~ 2, 3, T,- in (2.17) must be replaced with 

x t = -ki(xj(t)-Vi(t))+mJi(t) + Ui, (3.63) 

which, together with a change of variables ^ = i, — v,, brings the dynamics of the 
agents, i = 2,3 to the form 

Xi = %i + Vi (3.64) 

m,-|,- = -kit,, ■■ + ui (3.65) 

where Uj is as in (2.23), yk{zk) is as m (2.87) and Gk{) is taken as the natural 
logarithm. The signal v, is available for implementation in (3.64) once the update 
law in (3.29)-(3.30) is setup. The graph G is the same as in Section 2.6.2. 



3.4.1 Constant Reference Velocity 

We select agent 1 to possess a constant reference velocity v = [0.2 0.2] r . It follows 
that A in (3.21) is 2 . We choose 77(0) = [0.2 0.2] r and H = I 2 in (3.21)-(3.22). 
Agents 2 and 3 implement (3.63) and update v,- according to (3.29)-(3.30) with 
A = O2 and Bj = H. In Fig. 3.2, three agents first start with the adaptation turned 
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off. Since agents 2 and 3 possess incorrect information about v(t ) and since there is 
no adaptation, the relative distances \zk\ do not converge to the prescribed sets ^4. 
However, we note that |z&|'s remain bounded because the interaction forces Yk(Zk) 
act as "spring forces" between neighboring agents and prevent them from diverg- 
ing. At t — 10 sec, we turn on the adaptation for agents 2 and 3, which results in 
convergence to the desired distances \zk\ = 1 asymptotically. At the same time, Fig. 
3.3 illustrates that the estimated velocities vj and vf converge to the corresponding 
reference velocities v[ and v\, as guaranteed by Corollary 3.1. 



3.4.2 Motivating Example for the Augmented Design 

In this example, v(t) in the jfj-dynamics (2.19) is chosen to be time-varying and 
satisfies (3.21)-(3.22) with 

A = h®( °,l) (3.66) 



and 



1 



H-iC"°,?V (3-67) 



2 V 00-1 1 



We pick the initial condition of 77 as 77(0) = [^ — 3 ~ X ^] r - It is eas Y to 
verify that v(t) = [ — 3 sin(f) -^ cos(t)] T , which implies that x\ will rotate with a 

radius of -^. The other agents estimate v(t) by (3.29)-(3.30) with A in (3.66) and 
B t =H in (3.67), 1 = 2,3. 

To show the lack of estimate convergence, we suppose that initially the three 
agents form an equilateral triangle where x\(0) = [-^ 0] T , #2(0) = [—/r \] T and 

.13(0) = [~ g ~ j] r > thus satisfying the desired formation with \zk\ = \,k= 1,2,3. 
Fig. 3.4(a) shows that in the nonadaptive design where the reference velocity v(t) is 
available to each agent, the group exhibits a translational motion with x\ spinning 
around the origin. The adaptive case in Fig. 3.4(b) where the initial conditions are 
setto^(0) = &(0) = fe(0)=0,CJ 2 (0) = [-i-f f -\ -i + f \-% T 
andOT 3 (0) = [i-^ i^ + i ^-i _v^_i]r shows that the agents i = 2,3, 
exhibit a rotational motion about the leader, which means that the v(t ) information is 
not fully recovered for agents 2 and 3. However, the agents still maintain the group 
formation as expected from Theorem 3.1. 

To recover velocity tracking of a time- varying v(t), one choice is to modify the 
sets ^4 to the formulation of the position-based formation control in (2.51). Since 
position-based formation control can be transformed to the agreement problem, con- 
dition 2 in Corollary 3.1 is satisfied and thus |v,(f) — v(t)\ — > is guaranteed. Fig. 
3.5 shows that the basic adaptive design achieves the same group behavior as in the 
nonadaptive design Fig. 3.4(a), by taking the desired target sets to be the same as 
(2.81) with the choice of the interaction forces 
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Fig. 3.2 Snapshots of the formation in the adaptive design with constant v(f): In the first 10 sec- 
onds, the adaptation is off and the desired formation is not achieved. The adaptation is turned on 
at t = 10 sec, after which point the trajectories converge to the desired formation. [11]. Reprinted 
with Permission. Copyright Elsevier 2008. 



Vk{zk) = Zk-z d k - 

In the next section, we show that full recovery of the v(t) information is ensured 
by augmenting m, with relative velocity information. This augmented design does 
not need to modify the desired target set s^. 
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Fig. 3.3 Velocity convergence with the adaptive scheme in Fig. 3.2. [11]. Reprinted with Permis- 
sion. Copyright Elsevier 2008. 



3.5 The Augmented Design 



We now develop an augmented design that guarantees tracking of the reference 
velocity v(t). In the basic update law (3.29)-(3.30), v, stops updating when «,- in 
(2.23) reaches zero. As shown in Section 3.4.2, however, m, = does not mean that 
all the agents possess the same velocity v(t). Note that, in contrast, z — » would 
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Fig. 3.4 Two group behaviors: (a) Nonadaptive design with time-varying v(t): The group exhibits 
a translational motion with X\ spinning around the origin, (b) Adaptive design with time- varying 
v(t ) : The agents xi and x^ exhibit a rotational motion about the leader x\ . 



imply that all agents converge to the same velocity. Thus, in the augmented adaptive 
design we employ z to guarantee tracking of the reference velocity. 
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Fig. 3.5 Adaptive design with modified desired sets. The group exhibits the same translational 
motion as in the nonadaptive design Fig. 3.4(a). 



To present the augmented design, we introduce a static directed graph G v repre- 
senting the information topology for the relative velocity: If the /th agent has access 
to the relative velocity information i, — Xj, then the nodes i and j in the graph G v are 
connected by a directional link from j to i and agent j is a neighbor of agent ;'. We 
denote by ,jV { v the set of neighbors of agent i in G v , We assume that G v is balanced 
and strongly connected. 

Instead of the external feedback in (2.23), we now propose the augmented design 



- £ <fo V*(z*) _ H {xi-Xj) 



(3.68) 



with the update law (3.29)-(3.30). This augmented design recovers the stability re- 
sult of Theorem 2. 1 and ensures tracking of the reference velocity. 

Theorem 3.2. Consider the coordination laws in (3.23) and (3.24) which are de- 
signed the same as in Theorem 3.1 except that Uj is now defined in (3.68). Assume 
that G v is static, strongly connected and balanced. Then, the desired equilibria set 
£/* in (3.36) is stable. All trajectories {z(t),E, (t),(D(t)) starting in <S x RW^-l) are 
bounded and converge to the equilibria set §* in (3.35). If, in addition, Property 2.1 



holds, all trajectories (z(t), £, (t),W(t)) starting in ' 
(3.36). 



-i) 



converge to , 



in 

□ 



Proof. Using (1.11), (3.24), and (3.49), we rewrite (3.68) in the compact form 

u=-{D®I p )y~{U®I p ){y + v) (3.69) 
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where U is the graph Laplacian matrix of the velocity graph G 1 and we have used 
V'lpj — in Property 1.1. 

We take the same storage functions as in (2.35) and (3.48). With the new feedback 
(3.68), we note that % and V p remain the same as in (2.40) and (3.61) while Vf is 
now given by 

V f =(-u-{L v ®I p )(y + v)) T (y + v). (3.70) 

Thus, the time derivative of V = V p + Vf + V/ 7 is 

V = - £ Wi{&)- £ uJ yi -(y + v) T {L v ®I p )(y + v). (3.71) 

When G v is balanced and strongly connected, (y + v) (U ®I p )(y + v) is nonneg- 
ative (see Property 1.4). Thus, the derivative in (3.71) is negative semidefinite, 
which implies global stability of si* in (3.36) and boundedness of all the signals 

We next apply the Invariance Principle and investigate the largest invariant set 
J(' where V = 0. It follows that & = 0, V; e J, m = 0, Vi ^ </ and (y + v) T {U (g> 
Ip){y + v) = 0. Applying the same analysis as in the proof of Theorem 3.1, we 
conclude u = on ^#'. For dynamic block J^, we obtain from |,- = 0, ui = 0, (2.1 1) 
and (2.13) that y; = 0. Likewise, the static block satisfies y, = /!,(«,), which implies 
that y,- vanishes on ./#'. Therefore, y, =0 on ^#', V(. Recall from Property 1.4 
that (y + v) r (L v (g)/p)(y + v) is zero only when (y + v) T (Ll )(y + v) is zero. Since 
the graph G v is strongly connected, the graph corresponding to L\ ym is connected. 
Therefore, (y + v) T (L v <8)I p )(y + v) =0 implies y + v= ljy <8> c, where c € K p . We 
then conclude from vi = and yi = that |y + v| = 0. Since y = 0, it follows that 
v = 0, which implies from (3.49) that tracking of v(t) is achieved. Because of the 
uniqueness property in Lemma 3.2, dj — *• follows. □ 



3.5.1 Motivating Example Revisited 

We now include the relative velocity information in the external feedback m, in the 
adaptive design in Section 3.4.2. We assume that G v is a directed cyclic graph, where 
agent (' is the only neighbor of agent / + 1 for i = 1,2 and agent 3 is the only neighbor 
of agent 1. We modify m, in Section 3.4.2 by adding the relative velocity for each 
agent with respect to its neighbor. Fig. 3.6 shows the snapshots of the formation. The 
group now exhibits a translational motion with x\ circling around the origin, which 
means that the nonadaptive results are fully recovered. In addition, the estimates of 
the reference velocity converge to their true values as shown in Fig. 3.7. 
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Fig. 3.6 The augmented adaptive design (3.68) recovers the velocity tracking of the nonadaptive 
design as well as ensuring the desired formation. 



3.6 When There Is No Leader 



In the case where there is no leader in the group and no prescribed reference velocity 
v(t) is assigned to the group, we let each agent implement 



x, 



■■yt+Ht), 



i. 



,n, 



(3.72) 



where v,(f) is updated by (3.29)-(3.30). When ut in (3.29) is given by (2.23), Corol- 
lary 3.2 below proves the same convergence results as in Theorem 3.1. In particular, 
if condition 1 or 2 in Corollary 3.1 holds, the design (3.72) guarantees that i, con- 
verges to a common velocity v(t), Vi = 1, • • • ,N. When B, = B in (3.29) for some B 
matrix, Mi, this common velocity v(t) is characterized by 



X 

v{t) 



AX 
BX 



(3.73) 
(3.74) 



where;t(0) = ^E£iffl»(0). 



Corollary 3.2. Consider the coordination law in (3.72), where v,(f) is given by 
(3.29)-(3.30), w is defined in (2.23) in which Yk> k = 1»'"!^ are designed as 
in (2.27)-(2.31), and y,- is the output of ,^P[ with the input m,-, i — l,---,N, in 
which the passive block J 1 ^ is designed as in (2.11)-(2.15). Then, all trajecto- 
ries (z(t) , £, (t) , HI (t)) starting in 5f x M. pqN are bounded and converge to the set 
S x W ,C ' N , where G7 : 



07 



1 ! 



CTjy] and $ and <£ are as in (2.33) and (2.37). 
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Fig. 3.7 With the augmented adaptive design (3.68), the agents' estimates converge to the true 
reference velocity v(t). 



When Property 2.1 holds, all trajectories (z(t),^(t),SS(t)) starting in & x W N 
converge to the set stf x M. pq , where s$ is as in (2.36). 

When condition I or 2 in Corollary 3.1 holds, there exists a bounded v(t) G K p 
such that \\>i — v(t)\ — > 0, V7. In addition, if 5, = B in (3.29), Vz, then v(t) is as in 
(3. 73)-(3. 74) and \tB t (t) - % if) \ -»• 0. D 

Likewise, when w, is augmented with relative velocity information as in (3.68), 
i,'s reach agreement without requiring either condition in Corollary 3.1: 



3.6 When There Is No Leader 69 

Corollary 3.3. Consider the coordination law in (3. 72), which is designed the same 
as in Corollary 3.2 except that m,- is now defined in (3.68). Suppose that G v is 
constant, strongly connected and balanced. Then, all trajectories (z{t) , £, (t) , (D '(?)) 
starting in Sf x MP 9 are bounded, where Sf is in (2.37). The signals (z(t),^(t)) 
converge to the set S , where § is as in (2.33). "When Property 2.1 holds, (z(t),^(t)) 
converge to the set si, where si is as in (2.36). Moreover, |v; — v(t)\ — > for some 
bounded v(t) £ ]R P and for all i. IfBj = B in (3.29), \/i, then v(t) is as in (3. 73)-(3. 74) 
and \(Bf(t) - x(t)\ — ►(). 

The proofs for Corollary 3.2 and 3.3 are given in Appendix A. 1 and A.2, respec- 
tively. 



Chapter 4 

Adaptive Design for Reference Velocity 

Recovery: Parameterization Approach 



4.1 Introduction 

The designs in Sections 3.3 and 3.5 restrict the reference velocity v(t) to be constant 
or periodically time-varying. In this section, we present adaptive designs that are 
applicable to any time-varying, uniformly bounded and C 1 reference velocity v(t) 
that can be parameterized as 

v(0 = E &(?)& = (®(t) T ®I P )0 (4.1) 

where <p-'(t) £ R, j = 1, • • • ,r are basis functions available to each agent, 6> ; £ K p 
are column vectors available only to the leader, 

*(t) = [4>\t),-,4> r (f)] T (4.2) 

and 

e = [(e l ) T ,--- 1 (e r ) T ] T . (4.3) 

We let agent ;', i = 2, • • • ,N, estimate the unknown ; by B(, and construct v,(Y) from 

M0 = L^W5/ = (*W r ®^)ft i = 2,-~M (4-4) 

where 

4 = [(^) r ,-,(W- (4-5) 

In the following sections, we first develop a basic adaptive design with which 
agent i updates its estimate 0,-, / = 2, • • • ,N. Like the design in Section 3.3, this ba- 
sic adaptive design recovers objective A2 as well as guaranteeing objective Al in 
some special cases, such as, the agreement problem. To ensure objective Al when 
the basic adaptive design fails, we then modify the basic adaptive design in a similar 
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fashion to Section 3.5 and obtain the augmented adaptive design. We next apply the 
adaptive design result to an extremum seeking example. In this example, a group 
leader autonomously determines the Newton direction towards the extremum by 
sampling a field distribution and parameterizes the group reference velocity accord- 
ing to the Newton direction. The other agents then estimate this reference velocity 
using the basic adaptive design and reconstruct the desired formation during ex- 
tremum seeking. Before proceeding to these results, we first compare the parame- 
terization approach and the internal model approach. 

In the parameterization approach, the availability of the basis functions to each 
agent is similar to Assumption 1 in the internal model approach, where the A matrix 
is available to each agent. The use of the basis functions (j> J '(t) removes the re- 
striction in the internal model approach that v(t) be constant or periodic. The basis 
functions may then be used to shape the transient of the reference velocity profile. 
However, since ; (?)' s are time-dependent, the agents need to have synchronized 
clocks to implement this parameterization approach. The next example compares 
the number of the internal states used for estimating periodic reference velocity in 
these two approaches. 

Example 4.1. We consider a scalar reference velocity v(t), parameterized by 

v ( f ) = £(a/sin(w,-f) +6 ; cos(w,-f)). (4.6) 

i=i 

Note that v(t) in (4.6) is already parameterized by the basis functions sin(w,f ) and 
cos(w,f ), i = 1 , • • • , r\ . Therefore, the total number of unknown parameters that pa- 
rameterizes this v(t) is 2r\, which means that for the parameterization approach, 
each agent (except the leader) needs to update 2r\ internal states to estimate these 
unknown parameters. 

In the internal model approach, we choose A in (3.15) as 

— w\ \ ( — vt-v, 



A = diag lU o>-'U « )) (4 - 7) 

which implies that the dimension of GJ, in (3.29) is 1r\. This means that each agent 
(except the leader) also maintains 1r\ internal states to estimate the reference ve- 
locity. Thus, in estimating generic periodic reference velocities, both the parameter- 
ization approach and the internal model approach use the same number of internal 
states. 

In some special cases, however, the parameterization approach may require less 
internal states. For example, if in (4.6) b, = 0, i = 1 , • • • , n , then the parameterization 
approach only requires r\ internal states which estimate all a,'s while the internal 
model approach still requires A to be the same as (4.7) and the dimension of SJ,- to 
be2n. □ 
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4.2 The Basic Design 

We choose the update law for the parameter 0, in (4.4) as 

e i =A i {<P{t)®I p )u i (4.8) 

in which A, = Aj > and m, is as in (2.23). As proven in Theorem 4.1 below, the 
basic adaptive design (3.23), (3.24), and (4.8) guarantees convergence to the desired 
target sets (objective A2). Whether objective Al is achieved or not depends on the 
convergence of 0, to 0, which will be studied in Section 4.3. When 0, converges to 
0, v(t) is recovered with the adaptive design and, thus, object Al is also achieved. 

Theorem 4.1. Consider the coordination laws in (3.23), (3.24), (4.4) and (4.8) 
where v(t) is uniformly bounded and piecewise continuous, parameterized as (4.1) 
in which <j> J (t), j — 1, ■ ■ ■ ,r are uniformly bounded, and Jifj, i — 1, • • • ,N, and \j/h 
k = {,-■■,£ are designed as in (2.11)-(2.15) and (2.27)-(2.31), respectively. Then, 
the set 

<T = {(z,£, 0)|£=O, (D<E)I p )\i/(z)=Oandze^(D T <g)I p ), = 0*} (4.9) 

is stable, where = [0j , • • • , 9^} T and 6* = 1 w _ i <x> 0. All trajectories (z(t) ,%(t),@ (t)) 
starting in & x M. pr< - N ~ l > are bounded and converge to the set S x W r ( \ where 
S and *$ are as in (2.33) and (2.37). Moreover, when Property 2.1 holds, all trajec- 
tories {z{t),£, (t),6(t)) starting in & x W^ N -^ converge to the set srf x W r ( N ~ l \ 
where 8$ is as in (2.36). □ 

To obtain the closed-loop structure of the basic adaptive design, we denote by 0, 
the error variable 

0, = 0,-0 i = 2,---,N, (4.10) 

and note from (4.8) that 

& i =A i (#(t)®I p )ut. (4.H) 

Using (4.1) and (4.4), we get 

v i = v l -v{t) = {<J>{t) T ®I p )d il i = 2, — ,N. (4.12) 

We set 0i = and vi = 0, and define 

= [07,0 2 r ,---,0^] r (4.13) 

and 

v=(I N ®<P T (t)®I p )6 = [v{,---,vJ f ] T . (4.14) 

The closed-loop structure of the basic adaptive design is then shown in Fig. 4.1. 

We now give a passivity interpretation of the basic adaptive design. Because the 
single integrator is passive and because the feedback path from u to v exhibits the 
same structure as Structure 1, we obtain the passivity from u to v. We then conclude 
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Fig. 4.1 The closed-loop structure of the basic adaptive design. The appearance of <P{t) and its 
transpose before and after the integrator implies the passivity from u to v. The closed-loop stability 
follows from the interconnection of the passive feedforward path and the passive feedback paths. 



from the passivity result if) in Theorem 2.1 and Structure 2 that the feedback path 
is passive from u to y + v. As proven in Theorem 2.1, the feedforward path is also 
passive. Therefore, the closed-loop stability follows from Structure 3. The detailed 
proof is given below. 

Proof. To prove the stability of the closed-loop system described by the adaptive 
design (2.11), (3.55) and (4.11), we exploit the passivity properties of the intercon- 
nected systems and consider V/(z) and V/,(<^) in (2.35) and 



v a (e) = -Y j e?Ar i § i , 



(4.15) 



which are the storage functions for the three paths in Fig. 4. 1 . In particular, the time 
derivatives of Vf(z) and V/,(^) are the same as (3.56) and (2.40). 
Using (4.1 1), we obtain 

V a = j^6fAr% 



LOf A: 

i=2 


- l Ai(<P{t)®I p )ui 


N 
i—2 


{t)®Ip)Ui 


T ~ 
U V. 





(4.16) 
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From (2.40), (3.56) and (4.16), the Lyapunov function 

V(z,S,8)=V f (z) + V b ($)+V a (6) (4.17) 

yields the negative semidefinite derivative 

which implies that all the trajectories (z(t), 4 (t) , 6 (t )) are bounded. We further con- 
clude from Theorem B.5 in Appendix B.3 that £,- — > 0, V; e ^ and that «,- — > 0, 
V7 ^ >y. We next show that «,• — > 0, V; € ^. To this end we note that 

&=3-«f+3FW ( 4 - 19 ) 

is continuous and uniformly bounded because it and | are continuous functions of 
the bounded signals (z(t),t;(t),d(t),<P(t)) and because /((•,•) is C 1 . Since 4; — > 
and i,- is continuous and bounded, it follows from Theorem B.4 in Appendix B.3 
that 4; — * 0, which, from (2.1 1) and (2.12), guarantees m — > 0. 

Finally, we note that u — > implies from (2.26) that y(z) converges to the null 
space jY(D®Ip). This, in turn, implies that the trajectories (z(t),^ (t),0(t)) starting 
in Sf x RMtf-1) converge to the set <? x iM"" 1 ), where # and S? are as in (2.33) 
and (2.37). Moreover, when Property 2.1 holds, all trajectories converge to the set 
si x RM"-!), where si is as in (2.36). □ 



4.3 Parameter Convergence 

Parameter convergence is essential for recovering objective Al in Section 2.2 be- 
cause 6j — ► 9 implies |v,(f) — v(f)| — > 0. In this section, we restrict our attention 
to the group agreement problem as a special case of the adaptive design and show 
that the parameter convergence is achieved. We note, however, that the convergence 
to the desired target set (objective A2) is guaranteed by Theorem 4.1 even without 
parameter convergence. 

We assume that f%(Zk)'s are positive definite and radially unbounded functions 
on Sf^fc = W such that (2.46) is satisfied and thus, Property 2.1 holds. We further 
assume that the passive feedback block Jtf is in the control affine form 

4=/,fe)+S/(£k (4-20) 

yi = hi($i) (4.21) 

where 

h t {0) = 0, /,(0) = (4.22) 



76 4 Adaptive Design for Reference Velocity Recovery: Parameterization Approach 

and that the regressor <t>(t) in (4.4) is persistently exciting (PE), which means that 
for all t > 0, 

/ <P(t)<P(t) T dt>aI (4.23) 

Jto 

with some constants 8 > and a > that do not depend on t a . This PE condition 
ensures the information richness of the time-varying signal <P(t) throughout time, 
and guarantees parameter convergence: 

Theorem 4.2. In Theorem 4.1, suppose that the desired sets are jz4 = {0}, and 
that the passive feedback block is of the form (4.20)-(4.22). If<P(t) satisfies the PE 
condition (4.23), then the origin of (z,^,0) is globally uniformly asymptotically 
stable. In particular, &i — ► 8, i = 2, • • • ,N as t — ► °°. □ 

Proof. To prove parameter convergence in this case, we use the Nested Matrosov 
Theorem reviewed in Appendix B.5. The first auxiliary function V\ is the same as 
the Lyapunov function V in (4.17), which yields the negative semidefinite derivative 
in (4.18) and thus guarantees uniform global stability, that is, 

N 

7i=V<£-WK&):=Fi<0. (4.24) 

The second auxiliary function is 

V 2 = z T {D®I p ) + r y (4.25) 

where (D®I p ) + denotes the pseudoinverse of D&I p and 

r = diagi^A^O))- 1 ,- • • , (Z^MO))- 1 }. (4-26) 

In particular L gi hj(0) := A. I c._ gi(0) is nonsingular and thus invertible because 
of the passivity of the (^-subsystems in (4.20) and because of Proposition B.l in 
Appendix B.4. The derivative of V% yields 

V2=z T (D®I p )+ry + z T (D®I p ) + ry.= Y 2 (4.27) 

where we claim that 

Ti = =► Y 2 < 0. (4.28) 

To see this, note that Y\ = implies % = and it follows from (4.22) that y = 0, 
which means that the second term in V 2 vanishes. Because 3), = L gj hi(Q)ui when 
t, = 0, Y 2 becomes 

Y 2 = z T {D(g>I p ) + u. (4.29) 

Substituting (2.26) and z T = x T (D<g>I p ) from (2.6), we obtain 

Y 2 = -x T {D(g>I p )(D(g>I p ) + {D(g>I p )y(z) 
= -x T {D®I p )\j/(z) 
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= -z T w(z)<0. (4.30) 

Next we introduce the auxiliary function 

Vi = -((D T ®I p )v) T z (4.31) 

where v is defined in (4.14). Its derivative is 

y 3 = -{{D T ®I p )v) T z- ((D T ®I p )f) T z := Y 3 (4.32) 

and we claim 

Fl=0, F 2 = =► Y 3 =-{(D T ®I p )v} T {(D T ®I p )v}<0. (4.33) 

To show (4.33), we first note that Y 2 — implies that z T y(z) = and thus z = 
due to (2.46), which means that the second term in (4.32) vanishes. It follows from 
Y\ = that E, = and hence y is zero from (4.22). Therefore, z in (3.55) becomes 
(D T (g>I p )v, which proves (4.33). 

Finally, we define the auxiliary function 

V 4 = -6 T S(t)G (4.34) 

S(t) := / e^-^F{x)F{x) T dT F{t) := I N ®<P(t)(g>I p (4.35) 



where 



S(t) > f e^-^F{x)F{%) T dx > ae- S I 
because of the PE property of <P(t). Note that 



(4.36) 



S{t) = e' f" e- z F{T)F{x) T dx + e'j [ f°° ' e - x F{x)F{x) T dx 

= S(t)-F(t)F(t) T . (4.37) 

From (4.37), we obtain 

% < -e T S(t)6 + v T v-20 T S(t)B := Y A (4.38) 

and claim 

Y 2 = 0,Y 3 = => Y 4 = -ae- s \9\ 2 <0 (4.39) 

because the second and third terms in (4.38) vanish when Y 2 = and Yt, = 0. Indeed, 
Yt, = leads to {D T ®I p )v(t) = 0, which indicates that v(t) lies in jV(D t ®I p ). 
Recall that jY {D t ®I p ) = \ N ®c, c eW and v\ = p . Therefore, it follows that 
v(t) = 0, which means that the second term in J4 (4.38) is zero. Likewise, from 
(2.46), Y2 — O results in z = 0, which means z belongs to the desired set stf and thus 
y/(z) and u are zero. It follows that = from (4.11), which shows that the third 
term in I4 vanishes. 
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Because Y, = 0, / = 1,2,3,4, imply (§,z, 9) =0 and we conclude from the Nested 
Matrosov Theorem in Appendix B.5 that the origin is globally uniformly asymptot- 
ically stable. Note that 0^0 implies 0, — ► 6, i = 2,--- ,7Y. □ 



4.4 The Augmented Design 

Like the design in Section 3.3, the basic adaptive design in Section 4.2 ensures track- 
ing of reference velocity (objective Al) only in special cases, such as the agreement 
problem studied in the previous section. To guarantee objective Al when the ba- 
sic adaptive design fails, we employ the augmented design (3.68) in the update 
law (4.8). The augmented design recovers the stability result of Theorem 2.1 and 
achieves tracking of the reference velocity as well. 

Theorem 4.3. Consider the coordination laws in (3.23), (3.24) and (4.8), where v(t) 
is parameterized as (4.1) in which (j) J (t), <j) J (t), j — 1 , • • • , r are continuous and uni- 
formly bounded. With m,- defined in (3.68), and Jiff, i= 1, • • • ,N, and Xj/^, k= 1 , • • • , i 
designed as in (2.11)-(2.15) and (2.27)-(2.31), all trajectories (z(t) , £, (t) , 6 (f)) 
starting in <$ x M.P r ( N ~ l > are bounded and converge to the set 

S; = {(z,4,0)|§ =0,(D®I p )\i/(z) =0,ze ,%(D T ®I p ),t t (t) = V W} > (4-40) 

where v,(?) is defined in (4.4) and Sf is as in (2.37). □ 

The closed-loop structure of the augmented adaptive design is given in Fig. 4.2. 
Recall from Property 1.4 that x T (U ®I p )x = l^x T (L\ ®I p )x, which is nonnegative 
since G v is strongly connected and balanced. Thus, the static block L^ ym in Fig. 4.2 is 
passive and the feedforward path from i to — u is also passive. As the passivity of the 
feedback path is already established in Theorem 4.1, we conclude the closed-loop 
stability of the system in Fig. 4.2. 

Proof. Note that (3.68) can be rewritten in the compact form (3.69). To prove the 
stability of the closed-loop system described by the adaptive design (2.11), (3.55), 
(3.69) and (4.1 1), we take the same Lyapunov function as in (4.17) and from (2.40), 
(3.70), (4.16) and (3.69), compute its time derivative as 

V = - £ W t (&)- £ uJy,-(y + v) T (L v (g>I p )(y + v) < (4.41) 

which implies stability and boundedness of (z(t),^(t), 9(t)). Using Theorem B.5, 
we further conclude that £,■ — > 0, V;' € J , Ui — ► 0, V; $ J and (y + v) T (V (g) I p ) (y + 
v) — ► 0. For dynamic block Jfi, it follows from £,- — ► and (2.11) that y,- — + 0. For 
static block J^, w, — ► implies y, = hi(iij) — ► 0. Thus, y — ► 0. Recall from (1.20) 
that (y + v) r (L v (g)/p)(y + v) is zero only when (y + v) T (Ll ym )(y + v) is zero. Since 
the graph G v is strongly connected, the graph corresponding to V sym is connected. 
Therefore, (y + v) T (L v ®I p )(y + v) = implies y + v= Iat^c, where c £ M. p . We 
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Fig. 4.2 The closed-loop structure of the augmented adaptive design. The property of V in (1.20) 
renders the passivity of the top loop. The closed-loop stability follows from the interconnection of 
two passive feedforward paths and two passive feedback paths. 



conclude from v\ = and y\ —* that |y+v| — ► 0. Since y — *■ 0, it follows that v - 
which implies from (4.14) that tracking of v(t) is achieved. 
We next show u — ► 0. To this end we note that 



dfi . ^ dft 
out oqj 



ii 



(4.42) 



is continuous and uniformly bounded because u and % are continuous functions of 
the bounded signals (z(t),^(t),S(t),<P(t),<i>(t)) and because /,(•,•) is C 1 . Since 
%i — ► and <^,- is continuous and bounded, it follows from Theorem B.4 that £,- — > 0, 
which, from (2.11) and (2.12), guarantees m,- — > 0. Since |v + v| — > 0, we conclude 
from (3.69) that (D®I p ) \fr(z) -> 0. D 

The main difference of Theorem 4.3 from Theorem 4.2 is that it achieves ref- 
erence velocity tracking directly while Theorem 4.2 establishes tracking by first 
achieving parameter convergence. Parameter convergence is sufficient but not nec- 
essary for velocity tracking (compare (4.1) and (4.4)). We next show that velocity 
tracking implies parameter convergence 0, — > when the regressor <P(t) satisfies 
the PE condition in (4.23). We need the following lemma. 



Lemma 4.1. Let 



X = f(X,t), 



(4.43) 
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where XgW andf(X, t):Wx M> -> M". //a/7 trajectories X (t) satisfy f(X(t),t)-> 
and Q (t) X(t) — > 0, where Q (t) £ M" is bounded and satisfies the PE property in 
(4.23), then X(t) -> 0. D 

Proof. We rewrite (4.43) as 

X = -f2(/)X2(r) r X + £(f) (4-44) 

where £(f) := r2(?)i2(f) r X + ./'(X,r), and note that £(/) -» since Q(t) T X and 
f{X,t) both converge to zero and since £2(t) is bounded. Solving for X from the 
linear time-varying model (4.44), we obtain 

X(t) = E(t,to)X(t )+ /*S(f,T)C(T)dT (4.45) 

where S(?,fo) is the state transition matrix. Because Q(t) is PE and because £(r) — ► 
as f — > oo, it follows from standard results in adaptive control (e.g., [62, 136]) that 
X(t)-+0. □ 

We now combine Theorem 4.3 and Lemma 4. 1 to prove parameter convergence: 

Corollary 4.1. Suppose all conditions of Theorem 4.3 hold. If in addition, <£>(?) 
satisfies (4.23), then 0[ — *• 0. □ 

Proof. We establish 0; — > by using the PE property (4.23) and Lemma 4. 1 to prove 
that |v| — > implies 0, — ► 0, that is 0,- — >• 0. 
We note from Theorem 4.3 that 

v={I N ®<P T (t)®I p )8->0 (4.46) 

and that 

$, = y\,(#(f) «/;,)«* ->0 (4.47) 

since ui — > 0. Because the signal <£> r (?) is PE, it follows from Lemma 4. 1 that 0, — > 0, 
which proves the parameter convergence 0,- — » 0. □ 

Example 4.2. To illustrate the parameter convergence, we simulate the example in 
Section 3.5.1. We take 



v(t) = ([sin(f) cos(f)]<g>/ 2 ) 



1 

e 2 



(4.48) 



where 1 = [-^ 0] r and 2 = [0 ^] r . This v(f) is the same as in Section 3.4.2. 
The estimate v,-(f) in (3.64) is obtained from (4.4) with <P(t) = [sin(f) cos(f)] r and 
$ = [(9/) r (A 2 ) r ] r updated by (4.8). 

The initial conditions of x,-(0), i= 1,2,3, §i(0), and 4(0), / = 2,3 are the same 
as in Section 3.4.2. The initial estimates are set to 02(0) = [4^ — I — \ — ^-] T 

and 03(0) = [^ j i — ^ ] r such that the group exhibits the same motion as in Fig. 
3.4(b) if (2.23) is used in (4.8). 
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snapshots of the formation 



formation at ^ trajectory of agent x 

u 




trajectory of leader x 



trajectory of agent x 



position-x (m) 

Fig. 4.3 The augmented adaptive design recovers the convergence properties of the nonadaptive 
design. [13]. Reprinted with Permission. Copyright Elsevier 2009. 



When the augmented feedback (3.68) is employed in (4.8), Fig. 4.3 shows the 
snapshots of the formation. The group now exhibits a translational motion with x\ 
circling around the origin, which means that the nonadaptive results are fully recov- 
ered. In addition, because <P{t) is PE, parameter convergence is achieved as shown 
in Fig. 4.4. In this simulation, the graphs G and G 1 are chosen the same as in Section 
3.5.1. □ 



4.5 Application to Gradient Climbing in Formation 



In this section, we apply the adaptive design result to a gradient climbing problem, 
where the group leader performs extremum seeking for the field minima or maxima, 
while the other agents maintain a desired formation with respect to the leader. Keep- 
ing a group formation during the gradient climbing may be desirable for reliable 
inter-vehicle communication/sensing, drag reduction, safety in adversarial environ- 
ments, etc. 

To achieve gradient climbing in a field distribution, the leader takes a discrete- 
time, optimization based extremum seeking approach. This extremum- seeking ap- 
proach, illustrated in Fig. 4.5, generates finite-difference approximations for the gra- 
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9\ and 8\ converge to S\ 




20 25 30 35 40 

time(s) 



#2 and S'l converge to S\ 




20 25 30 35 

time(s) 



Fig. 4.4 Parameter convergence with the augmented adaptive design, ft, 1 and 9j converge to 1 = 

[—■V- 0] T while §2 and 6? converge to 6 2 = [0 ^] T ■ [13]. Reprinted with Permission. Copyright 
Elsevier 2009. 



dient and the Hessian of the field, by "dithering" sensor positions. The advantage 
of this local approximation is that only the leader needs sensing capabilities, and 
communication of sensed variables and geographic proximity of sensors are not 
necessary for generating approximate gradients. 
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Po»Etr&n* 




Fig. 4.5 Gradient climbing by extremum seeking. Arrows represent the Newton motion, while 
triangular paths are the dither motion with the samples taken at positions marked by dots. The 
dither motion has three segments: Along horizontal axis from left to right, along the diagonal from 
right to left and along vertical axis from top to bottom. The directions of these three segments are 
denoted by [1,0], [-1,1], [0,-1]. 



After the dither motion, the leader calculates a Newton direction towards the field 
extremum. Thus, the group reference velocity v{t ) is determined autonomously by 
the leader, in the form of segments \>k{t), t £ [tk,tk+i]< that are updated in every 
iteration k according to the next Newton direction. Since v(t) is not available to 
the other agents, they need to estimate this v(t) information to achieve a successful 
gradient climbing in the desired formation. 

We let the leader parameterize its reference velocity as in (4.1) and apply the 
basic adaptive design in this chapter to ensure a desired group formation. During 
the dither motion of the leader, the other agents may turn off the velocity adaptation 
design so that they do not respond to the dither motion of the leader. Even if the 
adaptation is not turned off, we show with simulation results that if the Newton 
motion lasts sufficiently long, the followers respond only to the Newton motion 
while filtering out the dither component. 



4.5.1 Reference Velocity Assignment by the Leader 



In this section, we present the extremum seeking scheme performed by the leader. 
The analysis of the motion of the group will be pursued in Section 4.5.2. The goal 
in extremum- seeking based gradient climbing is to search for and move towards the 
maximum of a field distribution with an unknown functional form. The leader has 
access only to the scalar field measurements, and constructs the approximate gradi- 
ent and Hessian information of the field by finite-difference methods to compute a 
Newton direction. It then assigns an appropriate velocity along the computed New- 
ton direction. It is important to note that this scheme locates the maxima without 
position measurements. 
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We first review basic optimization tools that are instrumental in the extremum 
seeking design. We assume that the field has a spatial distribution characterized by 
a twice continuously differentiable function F(x) : R 2 — > R that has a unique maxi- 
mum at x = x*. Note that we restrict our attention to fields only in R 2 , however, the 
results can be extended to R 3 as well by employing appropriate finite-difference ap- 
proximations. Also note that if the function F(x) has multiple maxima, then the re- 
sults can be modified to prove regional convergence to the local maximum. Because 
only field measurements are available to the leader, we approximate the gradient 
and Hessian of F{x) by one-sided finite-difference gradient, G k , 

_ , .. r, F (xt + htei) — F(xk) 

VF{x k ) « G k [i\ := ^ Y 1 ^ (4.49) 

h k 



and Hessian, Hi 



V 2 F(x k ) « H k [i,j] := -^ \F(x k )+F(x k + h k ej + h k ej) 
- F(x k + h k ei) -F(x k + h k ef) 



(4.50) 



where h k denotes the finite-difference "dither" size, and e, is the i th unit vector. 
For an easier implementation, steepest descent may be preferable over Newton's 
Method; however, it is slower and does not provide a concrete convergence proof 
with nonvanishing step-size. We denote by B(x, a) the ball of radius a centered at x, 
i.e., B(jc, a) := {x\ \x — x\ < a}. The lemma below states that for sufficiently small 
dither size h k , and for small initial error \xq —x*\, finite-difference based Newton's 
Method locally converges to an (?(/z)-neighborhood of x* . The proof follows from 
standard arguments in unconstrained optimization theory, and is given in Appendix 
A.3. 

Lemma 4.2. Let F(x) : R —> R be twice continuously differentiable in an open 
convex set 3> £ R . Assume there exists a unique x* £ R , r > and /3 > such that 
B(x*,r) g @, VF(x*) = 0, V 2 F{x*y l exists with \\V 2 F (jc* ) ^ x 1 1 < P> and V ^W 
and V F(x) are Lipschitz continuous. Then there exist e > and h > 0, such that 
for all initial conditions xq G B(x* , e), and dither size h k <h the sequence {jCjt}j^Q 
generated by 

x k+1 =x k +H- l G k , t=0,l,--. (4.51) 

where G k and H k are as in (4.49)-(4.50) converges to an 0(h) neighborhood of x* 
q-linearly. □ 

We next introduce the Newton's Method-based gradient climbing scheme that 
the leader implements to locate the maximum of a field. We consider the agent 
model in (2.16) and the control design (2.17). We assume that the leader, say agent 
1, does not receive external feedback m, from other members of the group, hence 
mi ee 0. Recall that (2.16) and (2.17) can be transformed to (2.19) and (2.20). We 
then assume %\ (0) = 0, which implies from (2.19) and (2.20) that 



4.5 Application to Gradient Climbing in Formation 



85 



x l =v(t). 



(4.52) 



Note that if <jji(0) ^ 0, we can apply the velocity input v(t) = —£>\ + v(t) to the 
leader, and recover (4.52). 

We use Newton's Method to determine the next position for the leader, and set 
the reference velocity v{t) to steer the leader to that position. As illustrated in Fig. 
4.5, in the fcth extremum-seeking iteration, the leader first moves in [1, 0], [—1, 1], 
and [0, —1] directions rapidly to take samples of the field F(x) and computes the 
approximate gradient G k and the Hessian H k as in (4.49)-(4.50), and then moves in 
the approximate Newton direction l k = HZ G k , and arrives at x k+ \ = x k + l k . 

To prepare for an adaptive reference velocity estimation by the followers, we 
parameterize v(t) in a form similar to (4.1). For each motion segment we let the 
reference velocity have a fixed sinusoidal amplitude profile, with endpoints at zero, 
and change its direction between successive segments. We denote by vr; ,-i and vn the 
dither velocity in the [i,j] direction, where [i,j] e {[1,0], [—1, 1], [0,-1]}, and the 
Newton velocity in l k direction, respectively. Let t d be the duration of each dither 
motion segment, and T be that of the Newton motion. Therefore one iteration of 
the extremum seeking scheme takes A :=3t c / + T seconds. During each extremum 
seeking iteration, the leader switches its velocity as 



x\ 



v(0 



V[ L o](f)> tftk<t<tk + td, 

V[_ M ](f), ift k + t d <t<t k + 2t d , 

V[o,-i]W> Ht k + 2t d <t<t k + 3t d , 

v N (t), i£tk + 3t d <t<tk+i, 



where 

t k :=kA, k = 0,1,2, 

and V[i o]> v [-i l]> v [0.-i] an d V N are defined as: 



V [l,0]( ? 


2h k 

td 


r 




( 


l-cos(~(*-4))) 


[-!,!](' 


2h k 

td 


r 
i 


(l-cos(—(t-tk-td))) 


[0,-1] (t 


2h k 

td 


" 


i 


( 2k 

[l-cos{ — (t-t k -2t d )) 

v td 



v N (t) := ^(l-cos(^(f-4-3^))). 



(4.53) 

(4.54) 

(4.55) 
(4.56) 
(4.57) 
(4.58) 



The reference velocity v{t) in (4.53) and its derivative v(t) are continuous, and 
(v(t),v(t))\ te { tk+rad! t k+1 } = (0,0), n — 0,1,2, 3. Note that other continuous velocity 
profiles that vanish at t £ {t k +nt d , t k +i}, n = 0, 1,2, 3, along with their deriva- 
tives, are also applicable. The velocities in (4.55)-(4.58), when switched according 
to (4.53), achieve one iteration of extremum-seeing motion by driving the leader 
first to the appropriate "dither" positions and then to the next "Newton" position 



4 Adaptive Design for Reference Velocity Recovery: Parameterization Approach 



x k +i. Theorem 4.4 below proves that the extremum seeking scheme converges to 
an 0(h) neighborhood of the maximum x*, when h k < h is as in Lemma 4.2, and 
|x(0) — x*\ is sufficiently small. 

Theorem 4.4. Let the field distribution F(x) be twice continuously dijferentiable 
with a unique maximum at position x = x* EM. . Suppose the assumptions in Lemma 
4.2 hold and h be as defined therein. Then the Newton-based extremum seeking 
scheme applied to the vehicle model in (4.53) with velocity profiles (4.55)-(4.58) 
drives the vehicle to the 0(h) neighborhood of x* , provided that hk <h and \x(0) — 
x* | is sufficiently small. □ 

Proof. We show that the reference velocity profiles given in (4.53) first drive the 
leader in the appropriate dither directions, and then along the Newton direction. 
Consider vn i which drives the leader in horizontal position, i.e., along the vector 

[1,0]. At time t k , let the position of the leader be x\(tk) = [x\(tt) ,x\(tk)Y G K 2 - Then 
at time tf, + t c i/2 its position is: 

: X X (t k ) ' 

xi(t k )- 

x\ (tk) - 

■ X\ (t k ) - 



*!&+!) 



t k +t d /2 



tk 

2hk 

tcl 

2h 
td 

h 



[1,0] (0* 

tk+tdft 



2n 
(l-cos( — (t-t k )))dt 
td 



-wz sin ( — (t~t k )) 
2n td 

x l (tk) + h k 
x 2 (tk) 



tk+td/2 
tk 



(4.59) 



Likewise, 



X\(tk+td) 



x\(t k + t d /2) 
x\(t k + t d /2)+h k 



tk+tj 

tk+td/2 

1 







V[i,o]M* 

x l (t k )+2h k 
x 2 (t k ) 



(4.60) 



Similar calculations show that vr_i n and V[o.-i] achieve the desired dither motions 



as well. Note that after the third dither motion v 



[o,-i] 



the leader will be back at 



position x\(t k + 3td) =x\(t k ). Then, applying the "Newton" velocity vn after this 
point for T seconds drives the leader to 

1 ftk+itd+T 2K 

xi(t k +A) =x l (t k ) + l k - / (l-cos( — (t-t k -3t d )))dt 

= xi(t k )+l k =xi(t k+ i). (4.61) 

Therefore, by switching the velocities as in (4.53) the leader visits all dither posi- 
tions and moves to the next Newton position. The convergence result follows from 
Lemma 4.2. □ 



4.5 Application to Gradient Climbing in Formation 

4.5.2 Gradient Climbing in Formation 
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We have shown that using the switching strategy in (4.53) with the reference velocity 
v(f) parameterized as in (4.55)-(4.58), the leader locates the extrema of the field. 
We next investigate how to design the motion of the other agents to achieve gradient 
climbing in a desired formation. 

As discussed in Section 2.6, we may pursue the position-based or distance-based 
formation control formulation. In either formulation, we note that the other agents 
do not have the knowledge of the reference velocity v(t) which changes after each 
iteration of extremum seeking. Therefore, the adaptive designs in Chapter 3 and 
this chapter can be applied to estimate the v(f ) information. Since (4.55)-(4.58) are 
already parameterized as a product of a vector and a time-varying basis function, we 
will take the parameterization approach in this chapter. Then the dynamics of agent 
i, i = 2, • • • ,N are given by the basic adaptive design (3.24), (2.1 1), (4.4), and (4.8), 
where we assume that m, has already been designed according to the position-based 
or distance-based formation control formulation. Following (4.53), we obtain the 
basis function <P(t) £ R and the constant parameter 8 £ M 2 in (4.1) as 



*(0 



7l-cos(f(f-f,))), 
l-cos(^(t-t k -t d )) 



if t k <t<t k + t d , 
ift k + t d <t<t k + 2t d , 



(4.62) 



and 



(4.63) 



1 - cos(^ -t k - 2t d ))\ , if h + 2t d < t < t k + 3t d , 
1 - cos(^(t-t k - 3t d ))) , ift k + 3t d <t< t k +i, 



, ift k <t<t k + t d , 

if t k + t d <t<t k + 2t d , 

ift k + 2t d <t<t k + 3td, 
iit k + 3t d <t<t k+ i. 



In each motion segment, agent i employs the basic adaptive design in Section 4.2 to 
estimate by 0, and reconstruct the desired formation. 

If t d and T are sufficiently large, the result in Theorem 4. 1 implies that the desired 
formation is ensured during each motion segment. This means that the agents will 
follow both the dither and the Newton motions of the leader. However, if only the 
leader has the sensing ability, it may be desired that the other agents respond only 
to the Newton motion. This can be achieved by simply turning off the adaptation 
during the dither motion periods. Even if adaptation is not turned off, the other 
agents detect only the Newton motion if T is sufficiently larger than t d . To see this, 
we note from (4.53) that the average velocity of v(t) within one extremum seeking 
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agent 3 




agent 4 
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Fig. 4.6 The desired formation of four agents in the gradient climbing. The number indicates the 
desired length of each link. 



iteration is given by 



Vnv := 



t k +A 1 

v{t)dt = -rk- 



(4.64) 



Then if T is sufficiently large, A sa T and thus v a 



4k, which is indeed the av- 



erage of Vff{t) in (4.58) in one Newton motion period. This implies that for suffi- 
ciently large T, the dither motion is averaged out. Thus, we can choose a sufficiently 
large T to ensure that the other agents follow only the Newton motion within each 
extremum- seeking iteration. In fact, for large T, one can further reveal a time-scale 
separation behavior in the group motion and show that the convergence to the de- 
sired formation is achieved in the fast time scale, while the Newton motion is per- 
formed in the slow time-scale. We refer interested readers to [18] for further details. 



4.5.3 Simulation Results 



We simulate the gradient climbing of four agents modeled by (2.16) with m, = 
1. We consider the distance-based formation control in Section 2.7.1. The desired 
formation is a rhombus formation shown in Fig. 4.6. Note from Example 2.5 that 
to ensure an unambiguous desired formation of four agents, we need to specify 
the desired relative distances between every two agents. According to Fig. 4.6, we 
define z\ = X\ — X2 and set its desired distance as d\ = y/3. For the other Zk's, k = 
2,- • • ,6, their desired distances arerfj = 1- Given J^'s, the nonlinearity \jfk(Zk) can be 
designed according to (2.83)-(2.87). For the simulations we take <7i(s) — ln(i/v3) 
and (Tk{s) = ln(s), k — 2,- ■ ■ ,6. 
We let the field distribution be 



F(x,y) 



_ -0.1? 0Lv (l.lx-5) 2 -0.2e 01v '(0.8y-4) 2 
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Fig. 4.7 Gradient climbing by Newton-based extremum seeking with T — 18.5 sec, tj = 0.5 sec, 
and hi, = 0.05. Solid line represents the leader's trajectory, while dashed line, dash-dot line, and 
dots are the followers'. After an initial transient agents follow the leader's Newton motion in a 
rhombus formation, and average out the fast dither perturbations. 



which has a global maximum at x — [4.55, 5} T . We fix A — 20 sec and % = 0.05, 
and pick t c ( = 0.5 sec and T — 18.5 sec for the first simulation. We run the system 
(4.52) and (3.24), where the leader determines its velocity by extremum seeking as 
in (4.53) and (4.55)-(4.58) and the other agents estimate v(t) by (4.4) with <P{t) in 
(4.62). Fig. 4.7 shows that after an initial transient, agents follow the leader's New- 
ton motion in a rhombus formation, and average out the fast dither perturbations, 
while the leader locates the maxima of the field. In the second simulation, we per- 
form the dither motion at a slower speed with t c i = 4 sec, 7 = 8 sec. In this case, 
the agents in Fig. 4.8 fail to average out the dither motion, and follow a jittering 
trajectory. 



4.6 Summary 



In Chapters 3 and 4, we studied a group coordination problem where the reference 
velocity is available to only one agent while the others estimate this information 
with adaptive designs. We presented two approaches to the adaptive designs. The 
first approach assumes a constant or a periodic reference velocity while the sec- 
ond approach parameterizes the reference velocity as linear combinations of known 
time- varying basis functions with unknown coefficients. For each approach, we first 
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Fig. 4.8 Gradient climbing by Newton-based extremum seeking with T — 8 sec, tj = 4 sec, and 
h/, = 0.05. Solid line represents the leader's trajectory, while dashed line, dash-dot line, and dots 
are the followers'. The agents fail to average out the dither motion, and follow a jittering trajectory. 



proposed a basic adaptive design that guarantees objective A2. We showed that 
tracking of the reference velocity is recovered for some special cases including the 
agreement problem. We presented an example which shows that the estimates of 
the reference velocity may fail to converge to a time-varying reference velocity. For 
each approach, we then proposed an augmented adaptive redesign that employs rela- 
tive velocity feedback in addition to relative position feedback and achieves tracking 
of the reference velocity. 

We next applied the basic adaptive design in this chapter to an extremum seeking 
example, where the leader autonomously determines the Newton direction based 
on samples of a field distribution and parameterizes the group reference velocity 
according to the Newton direction. The other agents then estimate the reference 
velocity using the parameterization approach. In the simulation, we showed that if 
the Newton motion lasts sufficiently long within each extremum seeking period, the 
desired formation is reconstructed during gradient climbing. 



4.7 Notes 



• The extremum seeking results in this chapter are based on [18]. 
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• The extremum seeking approach in this chapter relies on nonlinear optimization 
techniques to estimate the gradient in discrete time. An alternative approach in ex- 
tremum seeking is to probe the system with sinusoidal inputs, and to make an online 
estimation of the gradient of the output relative to these inputs [151, 7]. 

• To enhance robustness to noise and input disturbance, existing modifications of 
adaptive design, such as cr-leakage modification [62], can be applied to the adaptive 
designs in Chapter 3 and this chapter. 



Chapter 5 

Attitude Coordination Without Inertial Frame 

Information 



5.1 Introduction 

The previous chapters modeled the agents as point robots and considered only po- 
sition control. However, in numerous applications, the attitude of the agents plays 
an important role, which means that the agents must be modeled as rigid bodies. In 
this chapter, we study an attitude agreement problem where each agent has access 
only to the relative attitude with respect to its neighboring agents and to its angular 
velocity in its own body frame. An important application of attitude agreement is 
space interferometry, where spacecraft need to maintain the same attitude. It is im- 
portant to note that we do not rely on inertial attitude measurements, which would 
require star tracker sensors that in practice are limited by their low-bandwidth. In 
contrast, relative attitude can be obtained by visual means and the angular velocity 
can be measured with a gyro. 

To develop decentralized controllers for group alignment, we extend the passivity- 
based framework in Section 2.3 from the point-based robot formation control to at- 
titude control problem. Because the inertial frame is not available to the agents, we 
represent the relative attitude in local body frames. The use of local body frames 
distinguishes the orientation coordination problem from the framework in Section 
2.2, where the relative position is represented in the inertial frame. 

We also draw on the earlier results on single rigid body attitude control [144], 
which exploit inherent passivity properties of rigid body dynamics and quaternion 
kinematics. In particular, we present a quaternion-based decentralized controller that 
locally asymptotically stabilizes the desired equilibria in which all the rigid bodies 
possess the same attitude, and rotate at the reference angular velocity. The resulting 
closed-loop structure is similar to the structure in Fig. 2.2. We further consider a sit- 
uation similar to Chapters 3 and 4, where the reference angular velocity is available 
only to the leader. Following similar design techniques in Chapter 4, we develop a 
decentralized adaptive controller that estimates the reference angular velocity and 
recovers the convergence results of the nonadaptive design. Although the presenta- 
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tion of the designs is based on the unit quaternion, we show that the same scheme is 
applicable to other parameterizations of SO(3). 



5.2 Kinematic Equation of Attitude Error 

We consider a group of rigid bodies i = 1, • • • ,N, whose attitudes are represented 
by the 3x3 orthonormal attitude matrix Rj ESO(3) 1 . The information flows of rel- 
ative attitude measurements are represented by the undirected connected graph G. 
Similarly to Chapter 2, we assign an orientation to the graph to simplify the analysis. 
If the fth and jth agents are connected by link k, we define the relative attitude of 
link k as 

R k ■- RfRj, if k G %+ and k G ££j (5.1) 

and the relative angular velocity of link k as 

& k := l (Oj, if k G %+ and k G ££j (5.2) 

where ££^ (Jzf - ) denotes the set of links for which agent (' is the positive (negative) 
end. This means that the relative angular velocity (b k is represented in the frame 
of the positive end of link k. To simplify the notation, we introduce the 3JV x 3M 
Rotational Incidence Matrix D, which consists of the 3x3 sub-blocks: 

-h if k G ^ + 

(R k ) T if JfcG_S?r (5.3) 

O3 otherwise, 



and note from (5.1) and (5.2) that 



W = D T (Q B (5.4) 



in which &= [(a> l ) T , ■■■, (d/)T and (0 B = [^(OiY, ■■■, ( N co N ) T ] T . 

The evolution of R k is obtained via direct differentiation with respect to the iner- 
tial frame [146]: 

— - ffl*#. (5.5) 

at 

, k = 1 , • • • , £, be the unit quaternion representation parameterizing 



Let q k 



4 

J 



R k , where q 1 ^ and q k are the scalar and the vector parts, respectively. Then R k is 
related to q k through the Rodriguez formula: 

R k = k+2(q}) 2 + 2q k Q q}, (5.6) 

and the kinematic equations of q k are obtained from (5.5) and (5.6) as: 



1 For a brief introduction of attitude matrix and its representations, we refer to Appendix B. 12. 
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(q k = -\{& k ) T q k v 

Jk '■ \ -k 1 k ~k , l^i k ( 5 - 7 ) 

[<K = ^q k Q (O k +^CO k q k . 

The following passivity property of the kinematics of unit quaternion, established 
in [84], is essential in the proof of Theorem 5.1 in Section 5.3. 



Lemma 5.1 (Passivity of unit quaternion). 

The unit quaternion kinematics (5. 7) is passive from a) to q v with the storage 



functionV={q k -l) 2 + \q k \ 2 . O 



We note that if R k equals 73, which means that the agents connected by link k have 
the same attitude, then q k = [0, 0, 0] r and q\ = ± v / l-\\q k \\ 2 = ±1. It follows that 
q k Q = ±1 correspond to the same attitude in SO(3). Without loss of generality, we 
choose to stabilize the q^ = + 1 in the rest of this chapter. Furthermore, we assume 
that initially all the agents choose the quaternions with positive scalar parts. 

When the columns of D are linearly dependent; that is, when the graph contains 
cycles, the unit quaternions q k are also dependent. To see this, suppose that the graph 
contains i — 1, ■ • • ,c cycles, each consisting of C, links. For each cycle i, label the 
consecutive links by k] , • • • , k t ' and assign the directions of the links such that the 
positive end of each link is the negative end of the next link in the sequence (see 
Fig. 2.1 1). Then, from the definition of relative attitude in (5.1), it follows that 

Q . 

]}#' =h, i=l,-,c (5.8) 

7=1 

or in terms of quaternions, 

q k - oq k f o---oq k f' = (±1000) T , /=l,---,c (5.9) 

where q k is the quaternion parametrization of R k of link k and o denotes the quater- 
nion multiplication 2 . 



5.3 Passivity-based Group Attitude Agreement 

The dynamics of the attitude of the z'th agent is given by the Euler equation: 

' Ji ' cot + ' coi x ' Ji ' cot = *% (5.10) 

where ' #; is the inertia matrix and 't, is the control input in the ith frame. The 
objective is to develop a decentralized coordination law for each agent that depends 

2 Given two unit quaternions p T = [po,pl] and q T — [qo,q^], where po, qo are scalars and p v , q v 

are 3x1 vectors, the quaternion product is defined as p o a = ( ^ tvvv 

V Pvlv + Poqv + qoPv 
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only on its angular velocity and on its relative attitude with respect to its neighbors 
to guarantee the following behaviors: 

Bl) Each agent achieves the same attitude as its neighbors in the limit; that is 
]imR-Rj=h, i,j= 1, ••-,#. 

B2) The angular velocity of each agent converges to a reference angular velocity 
(O d (t); that is lim I'flfc — CO d (t)\ =0, i=l,---N. 

We assume that (0 (t) and co d (t) are continuous and uniformly bounded. In ob- 
jective B2, the agents follow a> d (t) in their individual body frames. When objective 
B 1 is achieved, B2 means that the agents rotate synchronously. 

To achieve objectives Bl and B2, we follow the design procedure in Section 2.3. 
We first design an internal feedback loop ' T, for each agent i = 1 , • • • ,N that renders 
its dynamics passive from an external input signal m, left to be designed, to the 
angular velocity error 

AcD i := i co t -(D d (t). (5.11) 



Indeed, the controller 



6) d + a> d x '' Ji ' (Oj -fiA(0i + Ui, (5.12) 



where /, is a positive constant, yields the error dynamics system: 

M: , c fiAG) i + Aa) i x'J r i i (o i = -fAco i + u i , (5.13) 

which is strictly passive from m, to Aft), with the storage function V/(Aft),) = 
^(Aa>j) T ' ^(AcOi. The stability analysis below relies on this passivity property, and 
not on the particular form of the control law (5.12). To achieve further design flexi- 
bility, other controllers that achieve passivity from Uj to A (Qj may be employed. 
Next, we design the external feedback w, of the form: 

n= £ q\- L flf. (5.14) 

To synthesize this external feedback m,, each agent i obtains its neighbors' relative 
attitudes with respect to its own frame, parameterizes them by unit quaternions, and 
then adds up the vector parts of the unit quaternions. Thus, the control law (5.12) 
with the external input (5.14) is implementable in a decentralized fashion. 

To analyze the stability of the closed-loop system, we denote by Jk, k — !,-■■,£, 
the kinematic equation (5.7) for the Icth link with input G) k and output q\„ and let 

jr = diag{/i,-,/iv}®/3 < /=<fiag{ 1 < /i.-. JV «A} ( 5 - 15 ) 

g m = [A(£>l>---,A(ol] T u = [u T ll --- 1 u T N ] T t b = [ 1 tL---, n tJ / ] t (5.16) 

qo = [ql--Ao] T qv = [(qlf,-,(4f} T - (5-17) 

Using this notation, and rewriting (5.14) as 
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Fig. 5.1 Interconnection of the quaternion kinematics (5.7) and the error dynamics (5.13). Jk de- 
notes the quaternion kinematics (5.7) and J0f is as (5.13). 



u = (D®Ij,)q v 



(5.18) 



where D is as in (1.21), we obtain the closed-loop block diagram in Fig. 5.1. 

Theorem 5. 1 below makes use of the passivity properties of the feedforward and 
the feedback paths in Fig. 5.1, and analyzes stability properties of the origin in 
the {S'm, q v }-space. Before proceeding with this analysis, however, we point to 
an obstacle to the global convergence to the origin which arises when the graph 
contains cycles. To see this note that the set of equilibria for the closed-loop system 
in Fig. 5.1 is given by: 

= {(£ m ,qo,q v )\#a> = O, constraint (5.9) and (D®I 3 )q v = 0}, (5.19) 

where $ m = implies '<», = C0 d , which means that each agent achieves the refer- 
ence angular velocity. If the graph is acyclic, that is, if the columns of D are linearly 
independent, then the third constraint (D0l 3 )q v — implies q v = 0, which is the 
desired relative attitude where all agents are aligned. However, when the graph con- 
tains cycles, the null space of (Z)®^) is nontrivial, and the set in (5.19) contains 
additional equilibria. In Theorem 5.1 below we prove that all trajectories converge 
to the equilibrium set ff. 

Theorem 5.1. For the agents i — 1,- ■■ ,N defined by the dynamics (5.10), the control 
laws (5.12) and (5.18) guarantee that the signals {S m ^ qo, q v } converge to the set 
of equilibria in (5.19) and the equilibrium {<§m, qo, qv} = {0, + l|»,0} is stable. If 
D is full column rank, that is, if the graph is acyclic, then q r — * and S m — ► as 

t — >°o. □ 

Proof. We first make use of Lemma 5.1 and prove that the feedforward path in Fig. 
5.1 is passive from co B to — u. To show this, we take the storage function 



V u = (qo-le) T (qQ-le) + qlqv, 

and note that its derivative yields 



(5.20) 
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% = <a V (5.21) 

We further rewrite (5.21) from (5.4) as 

V u = (co B ) T (Dq v ). (5.22) 

It then follows from Lemma 5.2 below that 

V u = (co B ) T (-(D®I 3 )q v ) = (co B ) T (-u), (5.23) 

which proves passivity from co B to — u. 

Having proven passivity of the feedforward path using the storage function V„, 
we now take the Lyapunov function 

V = V u +\#lSS a , (5.24) 



2 
and note that its derivative along (5.7), (5.10), (5.12) and (5.18), satisfies 

V = -(a B ) T u + glu-gl&S a (5.25) 

= -{l N (g>co d ) T u-g^g (0 . (5.26) 

Recalling that u = (D®h)q v from (5.18) and that (D®h) T (\ N <g> co d ) = 0, we 
obtain from (5.26) that 

V = -&l&S m < 0, (5.27) 

which proves that the equilibrium {&&, qo, qv} = {0, +1^,0} is stable. It then fol- 
lows that all the signals (gio,u,qo,q v ) are bounded. It then follows from Theorem 
B.5 that g a — ► 0, that is |'ffl,- - co d (t )| — > 0. Next, we show that u — > 0. To this end, we 
note that A (Qj is continuous and uniformly bounded from (5.13) because the signals 
{u,j, Ad>i, l flu ^ i fo d } are bounded and continuous. Since Ao>i — * and Acdj is 
continuous and bounded, it follows from Theorem B.4 that A ci), — > 0, which implies 
from (5.13) that m,- -> 0. 

Finally, we note from (5.18) that u — > means that q v converges to the null space 
JK (D ® 1$) . In particular, if D is full column rank, which means there are no cycles 
in the graph, then j^(D<S>h) = and, thus, q v — > 0. D 

Lemma 5.2 below was used in the proof for Theorem 5. 1 to establish the passivity 
of the feedforward path in Fig. 5.1. 

Lemma 5.2 (Rotation invariance). 

The vector part q v of the unit quaternion satisfies: 

{D®h)q v = -Dq v (5.28) 

where D is as in (1.21) and D is as in (5.3). □ 

Proof. Define Dj = [di\,- ■ ■ ,du], where d^ is as in (5.3) and note that 



5.4 Other Representations of 50(3) 99 

I 
- D t q v = E d ik<l k v 

k=\ 

= E q' v + E {R p ) T {-<$)■ (5-29) 

leSff P eS?r 
Substituting (5.6) into (5.29) and using (1.21), we further obtain 

-A«v= E i- E «? 

= {D w ®I 3 )q v , (5.30) 

where Z), . represents the z'th row of Z), and note that (5.30) is equivalent to (5.28). 



5.4 Other Representations of SO(3) 

In this section, we show that the unit quaternion can be replaced by any other rep- 
resentation of SO(3), if this representation satisfies the passivity property (Lemma 
5.1) and the rotation invariance property (Lemma 5.2). Among the three-parameter 
representations that satisfy these two properties are the vector quaternion, Gibb's 
vector, modified Gibb's vector and the unit equivalent axis/angle. 

To present a unified storage function for these representations, we let h be a unit 
vector along the equivalent axis of R k and let G k be the equivalent angle. We then 
write the parameter vector sr in the form: 

s k = y(G k )h k , (5.31) 

where y(-) is a first-third quadrant odd nonlinearity and its specific form depends on 
the representation as follows: 

Qk 

Yvq{d ) = sin( — ) (vector quaternion) (5.32) 



2 

e k 



Yg{6 k ) = tan(y) (Gibb's vector) (5.33) 

gk 

YmG(d k ) = tan( — ) (modified Gibb's vector) (5.34) 

Ye(O k ) = k (equivalent axis/angle). (5.35) 

The following lemma unifies the earlier passivity results in [84] and [139]: 

Lemma 5.3. The kinematics of three-parameter parametrization s in (5.31), where 
y(-) is an odd first-third quadrant nonlinearity, is passive from CO to s . □ 

Proof. The kinematics of 9 k and h k are given by [146]: 
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9 k = {<b k ) T h k (5.36) 

h k = --{h k + cot — (h k f)6) k . (5.37) 

A valid storage function for all the representations defined in (5.31) is chosen as 

,6* 
V p = / y (6)d6. (5.38) 

Jo 

Since y(-) is a first-third quadrant odd function, V p is an even and positive definite 
function. The derivative of V p is 

v P = y{e k )e 

= (s k ) T a> k (5.39) 

which proves the passivity from cb k to s k . □ 
We now show that s k satisfies "rotation invariance", that is, 

(D®h)s=-Ds. (5.40) 

where ^[(^V-.^T] 7 . 

Lemma 5.4. The three-parameter parametrization s satisfies the property of rota- 
tion invariance (5.40). □ 

Proof. The property of rotation invariance originates from the fact that the equiv- 
alent axis of an attitude matrix is an eigenvector of that matrix associated with the 
eigenvalue 1, that is frh = h k . To prove the identity (5.40), we follow a procedure 
similar to that in (5.29)-(5.30), and obtain 

-DiS={Di,®h) Sl (5.41) 

which is equivalent to (5.40). □ 

Since s k satisfies the properties of passivity and rotation invariance, the frame- 
work in Theorem 5.1 follows directly. Indeed, the internal feedback law (5.12) re- 
mains the same while the external control law m, is now defined as 

«,= £ j*- £ sP, (5.42) 

which can be synthesized similarly as for the unit quaternion. 

However, because sr is a three-parameter representation and is only locally one- 
to-one and onto mappings of attitude matrix, it is not a globally nonsingular three- 
parameter representation of 50(3). In other words, the Jacobian that maps differen- 
tial changes in sr to differential changes in the attitude matrix is singular for some 
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orientations. For example, the vector quaternion (h k sin(^-)) and the Gibb's vector 

{h k tan{^-)) are singular when 6 k = %. This singularity is not a physical one in the 
sense that relative attitude can not move in some direction. It is a mathematical de- 
ficiency due to the local nature of three-parameter representation. In that singular 
situation, s k is not a valid parametrization and other representation that is nonsingu- 
lar at that point might be used. 



5.5 Attitude Coordination in the Plane 

We now reduce the controller from the three dimensional space to the plane. This 
means that the first two entries of 'a>, are zero and, thus, (5.10) is 



fi'COi 



(5.43) 



where J?i e R is the inertia of agent i in the plane, ' CO, • € K is the angular velocity and 
l %i 6 K is the control torque of agent i. Likewise, the D matrix in (5.3) now becomes 
— D, We note that for each agent the rotation axis is the z-axis, which implies that 



e k 



e 



<2o = cos — and q k = [0 sin — ] T , where 9 k € [— it, n) is the relative attitude of 
link k connecting the ith and y'th agents. We thus obtain from (5.14) the control law 



al ap 

: E sin (y)~ E sin (Y^- 

l&^+ pe%r 



(5.44) 



To see the equilibria that result from this controller, consider the special case of a 

ring interconnection in Fig. 2.1 1. In this case, the incidence matrix D has the cyclic 

structure 

I -\ ••• 1 \ 

1 1 -•• 

1 -1 ••• 



D 



(5.45) 



■■■ 1 -1 

V o o o ••• i -1/ 

whose null space is spanned by 1^. Then from (5.9) and by letting m in (5.14) to be 
zero, we obtain the equilibrium conditions 



and 



0' 0J 

sin — = sin — 

2 2 


i, j =h---, N 


N 




E 9' = c ■ 2% c 


= 0,---,N-l 


i=i 





(5.46) 



(5.47) 
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agent lQ 

I 



agent 2 



120° ! 120° 
/ 120 A 



) agent 3 



Fig. 5.2 One of the equilibria configurations for a ring connection of three agents. Arrows denote 
the headings of the agents. 



In addition to the desired equilibrium where 6' =0, i = I,- ■ ■ ,N, (5.46)-(5.47) pos- 
sess other solutions, including 



Q' = — i=l,---,N 



(5.48) 



which is depicted in Fig. 5.2 for the case of N = 3 agents. 

For N = 3, however, a study of the Jacobian linearization shows that the undesired 
equilibria are unstable. This is similar to the formation control example in Example 
2.4, where the undesired collinear equilibria are unstable. We thus conclude generic 
convergence to the desired equilibrium from all initial conditions except for those 
that lie on the stable manifolds of the unstable equilibria. 

For other representations s k = y(9 k )h k defined in Section 5.4, the controller u.j in 
(5.44) takes the form 

(5.49) 



M\lT 



u i =D i [y(e 1 ),-,Y(0 M )] 



and the equilibria are now described by 



7(0') = 7(0 y ), iJ=l,-,N 



(5.50) 



and (5.47). Further investigation shows that (5.47) and (5.50) always possess the 
equilibrium (5.48) as well as the desired one. 
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5.6 Adaptive Design for Reference Angular Velocity Recovery 

The design in Section 5.3 assumes that the reference angular velocity (O d {t) is avail- 
able to each agent. We now consider the situation where only the leader, say, the 
first agent i = 1, possesses the (O d {t) information. Since the structure in Fig. 5.1 is 
similar to that in Fig. 2.2, we follow procedures similar to those in Chapters 3 and 
4 to develop an adaptive design with which the remaining agents estimate (0 (t). 
For conciseness, we only consider the approach in Chapter 4. We then assume that 
ft) (?) € R 3 is parameterized as 

cA0 = I>W (5.51) 

7=1 

where </»'(?) are scalar basis functions available to each agent and /3 ; € R 3 are col- 
umn vectors available only to the leader i = 1 . 

The other agents estimate the unknown /3' by jS-', and construct '(of(t) from 

'<(») = L^COA^W'f®^ i = 2,-,N (5.52) 

wher e ^(O:=[0 1 (O,-,rW] r andA:=[(A 1 ) r ,-,(A'")T- 

The adaptive design employs the feedback law (5.14) in Section 5.3, and modifies 

(5.12) as 

1 ti = ' J\ G> d + (O d x ' Ji 1 0)i - fiAcOi + M, (5.53) 

't, = ! >i >(o? + tea? x i / i '©,■ - fMi + ih, i = 2, — ,N (5.54) 

where 'cof is now obtained from (5.52) and 

A(Qj := ' (Qj - > a>f . 

The update law for the parameter j6, is 

/3,.=A,(<£M<8>/3H (5.55) 

in which A, = Af > is the adaptive gain matrix and m, is as in (5.14). With this 
update law, we recover the convergence result proven for the nonadaptive design in 
Section 5.3. 

Theorem 5.2. Consider the agents i — 1, • • • ,N defined by the dynamics (5.10) and 
suppose the reference angular velocity (0 it) is available only to agent 1. The con- 
trol laws (5.53), (5.54) and (5.18) together with the update law (5.55) guarantee 
that the signals {qo^qyjAcOi^AcOi, pi}, i = 2,- •• ,N are bounded. Furthermore, A(D\ 
and Aa>i, i = 2, • • • ,N converge to zero and q v converges to the equilibria where 
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[D®lT,)q v = subject to the constraint in (5.9). In particular, if q v — > 0, then 

\ i (o i -eo d \^0. □ 

The proof for this theorem is given in Appendix A.4. Theorem 5.2 proves that 
when the attitudes of the rigid bodies reach agreement, that is, q v — ► 0, 'ft),- converges 
to the reference angular velocity co d (t). Such a case occurs when D is full column 
rank, which means that the graph is acyclic. If <P(t) satisfies the same PE condition 
as in (4.23), we can further show the parameter convergence j3, — > /3 from Lemma 
4.1. 



5.7 Simulation Results 

In this section, we present an example of three agents, where each of them is a 
neighbor of the other two. We take the moment of inertia in the body frame of each 
agent to be 

"10 0" 

0.63 

0.85 



1,2,3. (5.56) 



The first simulation shows the convergence to the desired equilibrium and to the 
reference angular velocity. The second simulation illustrates the adaptive scheme. 



5.7.1 Nonadaptive Design 

In this simulation, we show that three agents converge to the same attitude and the 
same reference angular velocity by only relative attitude information. The relative 
orientations R l (0), R 2 (0) and R 3 (0) are parameterized by the initial conditions: 

^(0) = [0.9289 0.0559 0.3652 -0.0260] r 

q 2 (0) = [0.8699 0.0165 0.4917 0.0353] r 

q 3 (0) = [0.6285 -0.0896 -0.7720 -0.0316] 7 . 

The desired angular velocity co d is chosen as [0.0429 0.0059 0.1413] r . The initial 
angular velocity in each agent's frame is zero. Fig. 5.3 shows that the desired relative 
attitude is indeed achieved with the design (5.12) and (5.14) since the scalar parts 
of a*, k= 1,2,3, converge to 1. Moreover, |'ft),- — (O d \, i= 1,2,3, converge to zero, 
which in turn implies that 'ft),- — > co d . 



5.7 Simulation Results 
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Fig. 5.3 Desired relative attitude convergence and reference angular velocity convergence. 



5.7.2 Adaptive Design 



In this simulation, we show the convergence of the adaptive design with the same 
constant desired angular velocity (O d as in the nonadaptive design. By applying the 
update law in (5.55) and control laws in (5.14), (5.53) and (5.54), we recover the 
convergence results in the nonadaptive design. Fig. 5.4 shows convergence to the 
reference angular velocity and convergence to the desired relative attitude where all 
the agents are aligned. 
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Fig. 5.4 Desired relative attitude convergence and reference angular velocity convergence in the 
adaptive design. 



5.8 Summary 



We studied an attitude alignment problem where the inertial frame information is not 
available to the agents. The decentralized controllers depend on the relative orienta- 
tions and local angular velocities, and guarantee boundedness of the trajectories as 
well as their convergence to the set of equilibria. We further considered the situation 
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where the reference angular velocity information is available only to one agent, and 
developed an adaptive design with which the other agents recover this information. 



5.9 Related Literature 

The attitude coordination problem has been considered in a number of publications. 
For example, reference [66] discussed several coordinated attitude control strategies, 
such as a leader-follower strategy and a virtual desired attitude strategy, and illus- 
trated by simulations the advantage of coordination in the presence of disturbances. 
Reference [141] introduced a decentralized controller to achieve the convergence 
of each agent to the same commanded desired attitude. In [75], the authors intro- 
duced an attitude coordination algorithm applicable to a ring communication graph. 
The results in [75] were extended into a more general graph in [108]. Both [75] 
and [108], however, rely on the inertial frame information. Reference [38] used the 
Modified Rodriguez Parameter to represent the absolute attitudes of rigid bodies. In 
[20], the authors presented a leader/follower strategy of attitude synchronization by 
employing absolute attitudes and angular velocity observers. Unlike the controllers 
in [141, 75, 108, 38, 66], which require the knowledge of the inertial frame, [95] 
presented a design that only depends on the relative orientation among neighboring 
agents. However, the design of [95] requires that the communication topology of the 
agents form a string connection. Reference [115] considered using relative angular 
velocity in attitude synchronization. The work in [58] considered kinematic control 
of attitude synchronization. Since the agent kinematics are relative degree one, the 
attitude synchronization can be achieved with strongly connected graphs. Recent 
research in [31] represented the attitude dynamics in the Lagrangian form and used 
contraction analysis to study coordinated attitude tracking with directed graphs. 



Chapter 6 

The Agreement of Euler-Lagrange Systems 



6.1 Introduction 

In this chapter, we further extend the passivity approach from point robots to agents 
modeled as Euler-Lagrange systems. The Euler-Lagrange model is prevalently used 
to model multi-link robots and encompasses the double integrator model in (2.16) 
as a special case. We first apply the design techniques in Chapter 2 and present a 
nominal design that achieves the agreement of the agents. We next consider the case 
where the dynamics of each agent are subject to a class of parameterized uncertainty. 
Such uncertainty may come from unknown parameters in agent dynamics or exter- 
nal disturbances, such as wind and friction. We develop adaptive control laws that 
compensate for the uncertainty and achieve objectives Al and A2. The closed-loop 
structure considered in this chapter is shown in Fig. 6.1, where the blocks of the 
network information flow and the agent dynamics are the feedforward and feedback 
subsystems in Fig. 2.2, respectively. 

To develop adaptive control laws, we draw on earlier results in single robot tra- 
jectory tracking and present two approaches. The first approach is motivated by 
[138, 143, 68], where a small cross term was added to a Lyapunov function to 
achieve adaptive trajectory tracking for single robot. Applying a similar approach to 
the agreement of multiple Euler-Lagrange systems, we obtain a Lyapunov function 
with a cross term of the storage functions in (2.35). Different from the Lyapunov 
function considered in Section 2.4, this Lyapunov function yields a negative definite 
derivative, which allows us to develop an adaptive design that recovers objectives 
Al and A2 in the presence of uncertainty. The second approach explores the de- 
sign flexibility offered by the passivity framework. We modify the nominal design 
by feeding forward the external feedback u to the velocity error y, which ensures 
that the Lyapunov function in Section 2.4 yields a negative definite derivative. The 
resulting closed-loop system takes a form similar to the controller in [121], where 
a virtual reference trajectory was introduced for adaptive design of single robot 
trajectory tracking. In our problem, the virtual reference velocity for each agent is 
the aggregation of the differences between itself and its neighbors. The results in 
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Fig. 6.1 The passivity structure considered in this chapter. Individual agent dynamics are now 
subject to uncertainty, such as wind disturbance and uncertain parameters in agent model. We will 
develop adaptive control laws that compensate for the uncertainty and achieve objectives Al and 

A2. 



this chapter extend the two widely-used approaches [138] and [121] in single robot 
trajectory tracking to the coordination of multiple robots. 



6.2 The Nominal System 



We consider a group of N agents, each modeled as an Euler-Lagrange system 

M t (xi)Xi + Ci{xi,Xi)xi = ii i = 1, ■ ■ ■ ,N (6.1) 

where x, G R p . The goal is to achieve the agreement of x,'s and objective Al in (2. 1). 
We assume that the reference velocity v(t) in objective Al is bounded and let 



Mv = SU P |v(f)|. 

r>0 



(6.2) 



We will consider the Euler-Lagrange systems that satisfy the following properties 
[70, 105]: 

Property 6.1. M,(x,) is positive definite and there exist two positive constants y mj 
and Ym j such that 

Ymjp < Mi( Xi ) < JmJp- (6.3) 



6.2 The Nominal System 1 1 1 

□ 

Property 6.2. The matrix C,-(x,-,i,) is bounded in x, and linear in xi and thus satisfies 

||Q(^,x,-)||</d|*| (6.4) 

where /„■ > 0. D 

To achieve the control objective, we follow the design steps in Section 2.3. We 
first introduce an internal feedback 

Xi = -Ki{ Xi - v(t)) +Mi( Xi )v(t) + Ci{xi,Xi)v{t) + u u Ki = Kj > (6.5) 

which, together with a change of variable ^, = i, — v{t), renders the following nom- 
inal system 

Xi = yt + v{t) (6.6) 

jp . f M;(jfj)4 = -Ci(pCi,Xi)%i - Kfei + iij 

' ' I yt = &• 

Choosing 

S,(6) - ^ T AW& (6.8) 

as a storage function and using the skew symmetry property in (1.35), we verify that 
Jrffi is strictly passive from m,- to yi. Thus, ^\ is of the form (2.1 1) with the required 
passivity property and satisfies (2.12) and (2.13). 

We next design m,. We define Zk as in (2.2). Following the result in Section 2.6.1, 
we choose m,- as in (2.68) with y/^ defined in (2.72), which yields 

u = -{L A ®I p )x=-{DA®I p )z (6.9) 

where A is defined in (2.73). Recall that for the agreement problem, u — ► is equiv- 
alent to z — > 0, since Property 2.1 is satisfied with % in (2.72). 

Although the internal feedback design in (6.5) and the choice of u in (6.9) follow 
techniques in Section 2.3, the stability results in Theorem 2.1 are not directly ap- 
plicable here. This is because the 3%[ dynamics in (6.7) contain x\(t) signal, thereby 
making M'i a non-autonomous system of <!;,■ and Zk- We next demonstrate uniformly 
globally asymptotic stability of the origin of (z,§) by applying the Nested Matrosov 
Theorem. The convergence to the origin of (z, |) means that the agreement of x,'s 
and reference velocity tracking are achieved. 

Theorem 6.1. Consider the agent dynamics in (6.1) with T; defined in (6.5) and u in 
(6.9). Then the origin of(^,z) is uniformly globally asymptotically stable. □ 

Proof. We take the Lyapunov function 

N t 

Vi = i>(&) + £ft(zit) (6.10) 

;=i k=\ 
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where S,- (<!;,■) and Pk(zk) are given in (6.8) and (2.71), respectively. Using (6.7), (2.6) 
and (6.9), we obtain the derivative of V\ as 

Vi = E (-#*& + 4 r "0 + L fofo (6. 1 1) 

/=1 k=\ 

N 

= -^1 K&-y T {DA®I p )z + z T {A®I p )(D T ®I p )y (6.12) 

1=1 

N 

= -£4 i r ^ 1 :=7 1 <0 (6.13) 

1=1 

which implies the uniform stability of the origin of {£,,£). 

To establish uniform asymptotic stability, we define an auxiliary function 

V 2 = -u T £, (6.14) 

and claim 

Y 1 =0=^V 2 :=Y 2 <0. (6.15) 

To see this claim, we note 

Y 2 = -u T £,-u T t (6.16) 

When V\ = 0, ^ is zero, which implies that the first term in (6.16) vanishes. It follows 
from (6.7) that 4 = MT (.*,•)«,• when t, = 0, which means 

TV 

F 2 | Fl=0 = £ UiMr l { Xi )ui < 0. (6.17) 

Using Property 6.1, we conclude (6.15). We further note that Y\ = Y 2 = implies 
§=0 and m = 0. Recall that for agreement problem Property 2. 1 is satisfied and 
thus u — is equivalent to z — (see (2.34)). Then all assumptions of the Nested 
Matrosov Theorem in Theorem B.5 are satisfied and we conclude uniformly glob- 
ally asymptotic stability of the origin of (| , z) . □ 



6.3 The Uncertain System 

Suppose that the dynamics of each agent are now subject to a class of uncertainty 
di(t,Xi,Xi), parameterized by 

di(t,Xi,xi) =Yi(t,Xi,Xi)8i, (6.18) 

where 6, € W is an unknown constant vector and Yi(?,x;,i,-) € M. pxr is a regressor 
matrix available to agent i. Then the agent's model becomes 

Mi(xi)xi + Q(xi,Xi)xi = Xi + Yi{t,xi,xi)di i= !,-•• ,N. (6.19) 
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We make the following assumption on the boundedness of Yj(t,Xi,Xi): 

Assumption 3 Ifxi is bounded, Yi(t,Xi,X{) is bounded for all t > andxi £ R p . □ 

The parameterization (6.18) encompasses a broad class of uncertainties. For ex- 
ample, when Yj(t,Xj,Xj) = I p , (6.18) models constant disturbances. From standard 
result in the robotics literature [143, 105], (6.18) can represent the effects due to un- 
known constant parameters in M,(jc,) and C,(x,,i,) in (6.5). To see this, we recall that 
both M{(xi) and C,(x,-,^,) are linear in these constant parameters [70, 105]. Suppose 
that M,(xi) and C,(xj-,i,) are estimated by M,(jc,) and C,(x;,i,) whose parameters are 
the estimates of the unknown constant parameters in M,(x,) and C,(x,-,i,). Instead of 
(6.5), agent ; implements 

Xi = -Ki{xi-v(t))+tii(xi)v{t)+Ci{xi,Xi)v{t)+Ui (6.20) 

= -Ki{Xi - V(0) +M i (x i )v(t) +Ci(Xi,Xi)v{t) + U; 

+ {Mi{xi)-Mi{xi))v{t) + (Ci(xi,Xi)-Q(xi,Xi))v(t). (6.21) 

Since M,(x,) and C,(i,-,x,) are linear in robot parameters, the last two terms in (6.21) 
satisfy 

(tii(xi) -Mi( Xi ))v(t) + (di(x h Xi)-Q(x h Xi))v(t) = Yi(v(t),v(t),Xi,Xi)0i (6-22) 

for some Yi(v(t), v(t),Xj,Xi) and 0{. The closed-loop system of (6.1) and (6.21) then 
takes the form (6.19). 

The next example uses the double integrator model (2.16) as a special case and 
illustrates the effects of the uncertainty di(t,Xi,X{). 

Example 6.1. A special case of the Euler Lagrange system is given by the double 
integrator model (2.16), where M,(x,) = m,7 p , C,(x;,i,) = p . In this case, (6.5) re- 
duces to (2.17), which, together with (2.16), leads to 

mjXi = —ki(xi — v(t )) + m(v(t) + u%. (6.23) 

Suppose that agent i has only the nominal value of m,, denoted by m,, for imple- 
mentation and is subject to a constant disturbance 6f £ W and viscous damping of 
the form Qfxu where df £ R is a constant. Then, the closed-loop dynamics of agent 
i become 





niiXt = -ki(xi - v(t)) +rhjv(t) + m,- + Q t + Ofxi 
= -ki(xi - v(t )) +mjv(t ) + u t + Yi(t,Xi,Xi)Qi 


(6.24) 
(6.25) 




dj(t,Xi,Xi) 




where 


Yj(t,Xi,Xi) = [I p xi v(t)] 


(6.26) 


and 


e i = [(d!) T ef f] r = [(0/) r ef m,~m,} T . 


(6.27) 
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We see that <i,(f,x,,i,) encompasses the uncertainty in the model parameter, un- 
known viscous damping and constant disturbance. 

We now perform a numerical simulation to see the effects due to a?,-(?,x;,i,). 
Consider a group of four agents with p = 1 . We choose m, = 1 and kj — 2, Vi. The 
nominal values for m, are set to rh\ = 0.9, mi — 0.8, mo, — 1.1 and m^ = 1.2. The 
viscous damping coefficient 6f is set to —0.05 while the constant disturbance dj 
is chosen as 0.5, V/. We choose v(t) — sin(f). The initial values for x, and i, are all 
zero. The incidence matrix D is given by 



(o 



D 



-l\ 
1 
-1 1 
\0 -1 1 / 



(6.28) 



which means that only agents 2 and 3, agents 3 and 4 and agents 4 and 1 are neigh- 
bors. The weight matrix A is set to ^3. 

The simulation result in Fig. 6.2 shows that the agreement objective fails due to 
the presence of di{t,Xi,Xi). Similarly, the tracking of the reference velocity is also 
not achieved, as illustrated in Fig. 6.3. □ 



0.15 




-0.05 - 



-0.15 



20 30 

time (sec) 



Fig. 6.2 In Example 6.1, the agreement of x,'s is not achieved because of the uncertainty 
di(t,Xj,ii)'s. 



As shown in the above example, the uncertain system (6.19) with the nominal 
feedback law (6.5) cannot achieve the agreement of x,'s and objective Al. To com- 
pensate for the uncertainty dj(t ,x,,i,), we denote by 0, the estimate of 0, and modify 
the control law (6.5) to 



6.4 A Preliminary Adaptive Design 
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Fig. 6.3 In Example 6.1, x,'s do not track v(t) because of the uncertainty i/,(f,x,,x,)'s. For clarity, 
we only show X\ — v(f ) in this figure. 



Xi = -Ki{xi - v(r)) +Mi(xi)v(t) + Ci(xi,Xi)v(t) + uj - Y t (t,xuXi)di. (6.29) 

The goal in the next sections is to design the update law for 0, such that the control 
law in (6.29) guarantees the agreement of jcj's and objective Al. 



6.4 A Preliminary Adaptive Design 



We now present a preliminary adaptive design based on the storage functions 5 , ,(<^,) 
and^ fc (z (fe )in(2.35).Let 

6i = 6j - t (6.30) 

and 

8 = [8?, — ,8£] T - (6.31) 

Note that (6.19) and (6.29) give rise to 

Xi=yt + v{t), 



yp. (M t (xi)qi = -Ci(xi,Xi)%i- K& + uj -Yj(t ,x ; ,i,)r5,- 



(6.32) 
(6.33) 



Consider the Lyapunov function 
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V = £ S t (£) + £ P, (zt) + 1 £ 0, r A,- ' 0, A,- = Af > (6.34) 

;=i k=\ L 1=1 

whose time derivative is given by 

TV A? 

V = - £ ^Kfa - £ 4 r ^(r,^,i,)0,- + 9fAr l e,. (6.35) 

1=1 i=i 

If we design the update law for 0, as 

& t =AiYt(t,xt,itfZt Ai=Aj>Q (6.36) 

we obtain from (6.30) and (6.35) that 

N 

V = -£ff K &<0 (6.37) 

1=1 

which proves stability of the origin of (| ,z, 0). 

Although the adaptation law (6.36) guarantees closed-loop stability, it does not 
ensure global convergence of u — > 0, i.e., the agreement of jc,-'s. To see this, let us 
look at the case where Yi(?,x,,i,) = I p . We note that the equilibrium of (6.33) and 
(6.36) is given by <!;, = and m, = 0,, which means that equilibria where u ^ exist. 
Thus, u — > cannot be achieved globally. The next numerical example confirms this. 

Example 6.2. We consider Example 6.1 with only constant disturbance 0,, which 
means in (6.25) Y/(f,x,,i,) = I p . According to (6.29) and (6.36), each agent imple- 
ments 

Xi = -K { {xi - v{t))+m{v{t) + iii - 6j (6.38) 

where 0, is update by 

( =A,(xi-v(r)). (6.39) 

We take A,- = 1 and set the initial conditions of 0,'s to 0i(O) = 0.1, 02(0) = 1.1, 
03(0) = 0.3 and 04(0) = 0.7. Fig. 6.4 shows that agreement is not achieved. □ 

One way to achieve u — > globally is to ensure that the time derivative of the 
Lyapunov function contains a negative definite term of u. We next present two ways 
to achieve this: In the first approach, we employ the "adding cross term" technique 
[138, 144] and find another Lyapunov function for the nominal design (6.5) that 
yields a negative definite derivative in terms of £, and u. An adaptive design based 
on this Lyapunov function guarantees u — > and thus z — > 0; In the second approach 
we explore the design flexibility of the passivity framework and modify the nom- 
inal design (6.5) by feeding forward m, to the velocity error xi — v(t). As we will 
illustrate, this feedthrough term results in a closed-loop system similar to the cele- 
brated Slotine-Li type system [121] for single robot tracking control. This approach 
yields better performance in convergence speed than the first approach but requires 
it information available for implementation. 
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Fig. 6.4 In the presence of uncertainty, if we initialize fli(0) = 0.1, 02(0) = 1.1, (%(0) = 0.3 and 
04(0) = 0.7, the agreement of x,'s is not achieved by (6.29) and (6.36). 



6.5 Design 1: Adding a Cross Term 

To state our result, we introduce the notation 
M = diag{Mi(*i), • ■ ■ ,M N (x N )} C = diag{Ci(xi,*i),-- -,C N (x N ,x N )}, (6.40) 



F c = diag{/ C i, ---j/cv} K = dmg{K u ---,K N }, 
Y = diag{Fi {t,x\ ,x\), ■ ■ ■ ,Y N (t ,x N ,x N )} 
and note from (6.33) that 

Af| = -C%-K%+u-Y6 
y = %—x- ltf<S>v(t). 

We now introduce the storage function 

V c = -fi T M.% + -z T (A(g>I p )z-e$ T MQ(u)u, e > 



Q(u) = diag 



(6.41) 

(6.42) 

(6.43) 
(6.44) 



(6.45) 



(6.46) 



where 

1 1 

P2| M 7F /p ' " ' ' l+2\u N \ 2 

The cross term e^ T MQ(u)u is added to create a negative definite term of u in V c 
for sufficiently small e. Proposition 6.1 below shows that this storage function guar- 
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antees passivity with respect to a modified output, namely x — l^ (g> v(t) — eQ(u)u 
instead of ^ = x — Iat <S> v(f). Theorem 6.2 then replaces E, in the adaptation law 
(6.36) with this modified output. 

Proposition 6.1. Let Q(u) be defined as in (6.46) and let 



e < mm 



2A ma x(Af)A ma x(izl) + ^7jA ma x(^e) + 4 (Mv^max(^c) + ^mnx(K)) 



^/X min (M)?i m m(A) \ 

A max (M)A max (Z)Zi) y ' 

77ien f/ie dynamics in (6.43)-(6.44) with u defined in (6.9) are strictly passive from 
—Y6tox—lN®v(t) — eQ(u)u. □ 

Proof. We first show that V c in (6.45) is indeed a positive definite function of z and 
|. We note from the definition of Q(w) in (6.46) that 

Ilfi(»)ll<l. (6-48) 

which allows us to bound the third term in V c as 

\et, T MQ(u)u\ = e\^ T MQ(u)(DA®I p )z\ 

< e\\M\\-\\DA\\-\$\-\z\. (6.49) 



Because 



and 



1^M4>^^| 2 (6 . 50) 

^®/,) z >^)|*f, (6 . 51) 

we conclude that V c is positive definite if 

0<£< V^WU ^l (652) 

K s ^.{M)k mia {DA) 

which is indeed the case from (6.47). 

The time derivative of V c along the trajectories of (6.43)-(6.44) is given by 

% = -^ T K^-^ T Ye-e{M^+Mt,) T Q(u)u-et, T MQ(u)u-et t T MQ(u)u. 

(6.53) 
Using (6.33) and the skew symmetry property (1.35), we further expand the third 
term in V c as 

e(Af| +M^) t Q(u)u = £ [(C r 4 -K£,) T Q(u)u + u T Q{u)u- (Y9) t Q(u)u\ . 

(6.54) 
We now consider the last two terms in (6.53). Note from (6.46) that 
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Q(u)u = diag\ - ""'" 

I (l+2\ui\ z y (l+2|w i v| / ) z 



All\U T y 



Au N u T N 



Since 



we obtain 



4uiuf 



(l+2k| 2 ) 2 



41 M; 



- A<i 



(l+2| M ,f) 2 - 1+4|h 
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(6.55) 

(6.56) 
(6.57) 

(6.58) 



|6(«)»| < |i*|. 
Using (6.9), (6.32) and (6.33), we get 

u = -(L 4 (8)/ p )(| + WO v(f)) = -(L A ®I p )t,. 

From (6.57) and (6.58), we bound the last two terms in (6.53) as 

\e^ T MQ{u)u\ < eA max (M)A max (L 4 )|^| 2 (6.59) 

\e$ T MQ(u)u\ < eA max (M)A max (L 4 )|^| 2 . (6.60) 

In particular, the last inequality (6.60) follows due to (6.48). 
We now rewrite (6.53) from (6.48), (6.54), (6.59) and (6.60) as 

% < -{K^{K)-2eK^{M)X mm {L A )M\ 2 -eu T Q{u)u+^-eQ{u)u){-Yd) 
- e{C T B, - K^f Q{u)u (6.61) 

< -(Km(K) - 2eA max (M)A max (L 4 ))|£ | 2 - e\Q(u)u\ 2 + (§ - eQ(u)u)(-Yd) 
+ eA max ( J Rr)|^|| M | + ei|C|l- 1§| • |g(u)«| (6.62) 

where we used 

u T Q(u)u > u T Q(u) T Q(u)u (6.63) 

which follows from (6.46). Property 6.2, together with (6.2) and (6.32), implies that 
||C|| < K^(FcM < Wc)|§| +A max (F c ) J U v (6.64) 

which allows us to bound the last term in (6.62) as 



N 



e\\C\\ • |C| • \Q\i,)u\ < ^eK^FcMf + eXm^iF^^M 



where we also used (6.48) and 



\Q{u)u\ < 



N 



2s/l 



To see (6.66), we note from (6.46) that 

\Q(u)u\ = 



*«">'«">- iffici^U*^ 



(6.65) 



(6.66) 



(6.67) 
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since 

(l+2kl 2 ) 2 -8- (6 ' 68) 

Then, V c in (6.62) becomes 

^=-(lflW^) r (ftO(ieM-l) +(§ - efiW,l)r( - yS) (6 ' 69) 



where 



and 



Because 



a = KUK) - 2eA ma x(M)A max (L 4 ) - — ^eA max (F c ) (6.70) 

& = -~e(A ni „(^K + W*))- (6.71) 



e < ^ m(jg) ; , (6.72) 

2A max (M)A max (L 4 ) + ^-7^hmx{F c ) + ^ (^v^max(^c) + ^max(^0) 

a > andag > £> 2 , we conclude that P 6 is positive definite. Noting § =i— 1jv<8>v(?) 
in (6.44), we then obtain from (6.69) the strict passivity from —70 to x— ljy ® v(f) — 
eQ(u)u. D 

It is possible to choose the normalization matrix Q(m) in (6.46) differently. 

For example, following [68], we can choose Q(k) = diag < 1+ | J p , ■■■ , i } , I p >, 

which is bounded and ensures the boundedness of Q(u)u. If we were to set Q(u) to 

identity matrix, we could not obtain the first bound in the right hand side of (6.65). 

With the strict passivity property in Proposition 6.1, Theorem 6.2 below proves 

t, — > and u — ► if agent i implements the feedback law (6.29) with 0, updated by 

§ i =A i Y i (t,Xi,x i f(x i -v(t)-e - U \ ), Ai=A[>0. (6.73) 

l+2\iij\ z 

Theorem 6.2. Consider a group ofN agents modeled as (6.19). Suppose that each 
agent implements (6.29) where M; is as in (6.9) and 9i is updated by (6.73) with a 
positive e satisfying (6.47). Then, the origin of(%,Z, 6) is stable and the trajectories 
of(^,z,6) are bounded. Moreover, \xj — v(t) | — > 0, V/ and \xt —Xj\ — > 0, V7, j. D 

Proof. The closed-loop system of (6.19), (6.29) and (6.73) is given by (6.43), (6.44) 
and 



6, = d i =A i Y i (t,x i ,x i ) T ($ i -e 1 ■ '|.,| 2 )- (6.74) 



it 
I +2\u, 
We take 



1 N 

Vi=V c + -Y^e[Ar l 0, (6.75) 

2 i=i 
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as a Lyapunov function, where V c is in (6.45). Using Proposition 6.1 and (6.74), we 
obtain 

* = -(iJ(ti„i)HeM»i) so ' ( " 6) 

which implies stability of the origin of (£, ,z, 0) and the boundedness of the signals 
(£,z, 0). We further note from Property 6.1 and 6.2, Assumption 3 and (6.33) that £, 
is bounded. Since E, is bounded, u in (6.58) is bounded and thus Q(u)ii is bounded. 
Using (6.57), we conclude that V\ is bounded. We then apply Theorem B.5 and 
obtain \ -> and Q(u)u -»• 0. It follows from (6.32) and (6.9) that |i, - v(t)\ -> 
and \xt —Xj\ — > 0, V;, /'. D 

Since the parameter £ is implemented in the update law (6.73), it must be chosen 
sufficiently small. To remove the dependence of £ on L4 , we can bound | \L& 1 1 by 
/V||4||. Given M, K, F c and N, there is a permissible range of £, which in turn 
determines the convergence rate. 

Because % and u both converge to zero, we note from (6.33) and (6.73) that 
Yi(t ,Xi,xi)6i — > and 0,- — > 0. If, in addition, Yt(t,Xi,xi) satisfies the PE condition in 
(4.23), we further conclude from Lemma 4.1 that 0, — » 0, i.e., 0, — > 0,-. We illustrate 
this parameter convergence in the following example. 

Example 6.3 (Example 6.1 continued). 

We apply (6.29) and (6.73) to Example 6.1 and obtain the closed-loop system for 
agent i as 

mix) = —kj(xi — v(t))+mjv(t) + Uj + Yi(t,Xi,Xi)(di — dj) (6.77) 

where v(t) = sin(?), Yj(t,xt,Xi) is in (6.26) and 0, is in (6.27). 

In the simulation, we first take £ = 0.15 and randomly choose the initial con- 
ditions for 0,. The update law for 0,- is in (6.73) with A, set to 2. All other initial 
conditions are the same as in Example 6.1. The simulation results in Figs. 6.5-6.6 
show that the differences between x,'s and the tracking error xi — v(t) converge to 
zero. With the same initial conditions, we next choose £ = 0.05. The agreement of 
xt's is shown in Fig. 6.7, where the convergence is slower than the case of £ = 0. 15. 

The estimates of for 9- in (6.27), j — 1,2,3, are shown in Figs. 6.8-6.10. We 
note that all 0/'s converge to their true values. This is because the regressor matrix 
Yf(t,Xi,Xi) becomes PE as |i, — sin(f)| — >0. To see this, note that Yi(t,xi,Xj) in (6.26) 
converges to [1 sin(?) cos(f)] as |i, — sin(f)| — ► 0. The regressor [1 sin(f) cos(t)] T 
is PE since Vt > 

ta+S 

[1 sin(s) cos(s)] 1 [1 sin(s) cos(s)}ds > al 3 (6.78) 

is satisfied with 5 — 2n and a = %. Let % = Yi(t,Xi,X{) — [1 sin(f) cos(f)]. Since 
Yi(t,Xi,Xi) is a continuous and bounded function of t, we drop the dependence of Y,- 
and Y on xi and i, and define 
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K = max\Yt(t)\. (6.79) 

I>0 



It then follows from (6.78) that 

rto+S 



f (Yttf-tiWfQriW-fiWds (6.80) 

f" Yi(s) T Yi(s) + Yi{s) T Yf{s) - Yi{s) T Yi(s) - Yi(s) T Yi{s)ds > a/ 3 .(6.81) 

J to 



Because |f;| — > 0, there exists a finite time t\ such that \/t >h, \Y\ <c for some 
positive scalar c. Thus, evaluating (6.81) at any t' > t\ leads to 

rt'+S 
ah < J i YiisfYiW+YiisfYiW-YiisfYiisJ-YiisfYi^ds (6.82) 

rt'+S rt'+S 

< I Yi(s) T Yi(s)ds + c 2 8h+2 I c\Yi(s)\dsI 3 

rt'+S 

< I r / (*) r ^(*)^+(2cJc5 + c 2 5)/ 3 . (6.83) 
Therefore, we have 

[ X Yi(s) T Yj(s)ds>(a-2cKS-c 2 S)h. (6.84) 

Then for sufficiently small c, i.e., sufficiently large t\, a — 2ck8 — c 2 S is positive 
and thus Y(t) is PE starting from time t\. 



6.6 Design 2: Feedforward of the External Feedback 

We note from Design 1 that the strict passivity in Proposition 6.1 is important in 
compensating for the uncertainty dj(t,Xi,Xj). In this section, we present another de- 
sign that guarantees this strict passivity by feeding forward m to y, in (6.33). In 
particular, we replace y, = %i in (6.33) with 

yi = Zi+itm (6.85) 

where J7 = r? > 0. This modification is achieved with a new feedback law 

TJ = Mj(xi)rjUi + Q(xi,Xi)riUi+Mi(xi)v(t) + C(xi,Xi)v{t) - Ki{xi - v{t) - J]m ; ) 
+ u i -Y i {t,Xi,Xi)d i . (6.86) 

Note that (6.86) employs u, which implies from (6.9) that relative velocity informa- 
tion must be available. 
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Fig. 6.5 Using the design (6.29) and (6.73) in Example 6.1, we achieve the agreement of x,'s. 
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Fig. 6.6 The tracking errors x, — v(t) converges to zero with the design (6.29) and (6.73). 
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Fig. 6.7 The convergence rate is slower than that in Fig. 6.5 since we decrease e from 0. 15 to 0.05 
in Example 6.3. 
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Fig. 6.8 The convergence of the estimates fy , i = 1, • • • , 4. 
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Fig. 6.9 The convergence of the estimates Of, i — 1, • • • , 4. 
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Fig. 6.10 The convergence of the estimates 9f , i =!,••• ,4, 
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We rewrite (6.32), (6.33) and (6.85) in a compact form 

x = % +Tu + In ® v(?) (6.87) 

Mt, = -C%-K%+u-Y9 (6.88) 

where F = diag{F,- • • ,F/}- Proposition 6.2 below proves strict passivity from 
—Y6 to x — lyv <8> v(f ) — Fm with the storage function 

V s = l -t; T Mt;+ l -z T {A®I p )z. (6.89) 

Proposition 6.2. The dynamics in (6.87) and (6.88) are strictly passive from —Y9 

tO X— l^rlg) v(f) — Fm. □ 

Proof. Consider the storage function (6.89). Its time derivative along the trajectories 
of (6.87) and (6.88) is given by 

V s = -Z T K%-u T ru-£ T Ye. (6.90) 

Noting | = x— In ® v(?) — Tu in (6.87), we obtain the required strict passivity 
result. □ 

We then propose the update law for 0, as 

4 = AiYiit^uxtfixi - v(t)-r t m) (6.91) 

and prove the same stability and convergence results as in Theorem 6.2. 

Theorem 6.3. Consider a group of N agents modeled as (6.19). If each agent im- 
plements (6.86) where M; is as in (6.9) and 9[ is updated by (6.91), then the ori- 
gin of (£,z,9) is stable and the trajectories of (E,,z,9) are bounded. Moreover, 
\xi — v(t) | — ► and \xt —Xj\ — > 0, Vi, j. D 

Proof. Using V = V s + jY^Li 9f A^ l 9t as a Lyapunov function, we obtain from 
(6.90), (6.91) and (6.30) that 

N 

V = - £ (#£& + ujllui) < 0. (6.92) 

The negative semidefinite derivative in (6.92) proves stability of the origin of 
(£,z, 9) and the boundedness of the signals (£,,z, 9). The rest of the proof mim- 
ics the proof for Theorem 6.2. □ 

Theorem 6.3 relies on the —ufTiUj term in (6.92), which is achieved by feeding 
forward Uj to the velocity error y,-. The next example shows that with this feedfor- 
ward term, the closed-loop system of (6. 19) and (6.86) exhibits a form similar to the 
Slotine-Li type of controller in [121]. To illustrate this similarity in its basic form, 
we assume that 0, = in this example. 
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Example 6.4. The Slotine-Li type controller [121] gives rise to a closed-loop system 
of the form 

Mi{xi)xi + Ci(xi,Xi)xi = Mi(xi)ri(xf(t) -xi) + C i (x i ,x i )r i (xf (t) -xi) + M,-(x,-)xf(f) 
+ Cixi^xf^-Kilxi-xf^+mxi-xiit))], (6.93) 

which achieves tracking of \xj — xf(t)\ — > and \xi —xf(t)\ — ► 0. The closed-loop 
system of (6. 19) and (6.86) is given by 

M/(x;)i 1 I - + C;(i;,X;)i; = Afi(Xi)Ii&i + Ci(xi,Xi)riUi+M{(Xi)v(t) 

+ C{xuXi)v{t)-Ki[xi-v{t)-Tiui]+Ui. (6.94) 

Comparing (6.93) and (6.94), we note that w, and i, — v(t) in (6.94) are similar to 
Xj —xf(t) and xi—xfif) in (6.93), respectively. In fact, in the Slotine-Li controller, 
the tracking objective is represented in the absolute coordinate xi while in our case, 
each agent's tracking objective is w, — > 0, which is represented in the coordinate of 
the relative variables z\- The last term m,- in (6.94) is used for interconnecting with 
the z dynamics. □ 

We now demonstrate that tuning the free parameter 77 can yield better perfor- 
mance in compensating for the uncertainty than the approach in Section 6.5. Note 
that this is at the cost of employing relative velocity information in (6.86). 

Example 6.5 (Example 6.1 continued). 

We now use Design 2 to compensate for the uncertainty in Example 6.1. Apply- 
ing (6.86) to the double integrator model (2.16) and noting that only the nominal 
values for m; are known, we obtain 

%i =miI{Ui+miv{t)—ki(xi — v{t)—riUi) + Ui + BJ + 0, 2 i,- Y,(?,x,-,x,)0,- (6.95) 

where the form of Y((t ,xi,xi) is to be specified. We further rewrite (6.95) as 

miXi=miriUi+mv(t)-ki(xi-v(t)-riUi)+Ui+Yi(t,Xi,Xi)8i-Yj(t,Xi,Xi)§i (6.96) 

where 6), is as in (6.27) and Yi{t,xi,x) is given by 

Yi(t,Xi,Xi) = [l xi v{t)+I\ui]. (6.97) 

The update law for 0, is given by (6.91). 

In the simulation, we choose 1] = 0.15 in (6.95), the same as e in Example 6.3. 
All the other parameters are the same as in Example 6.3. The simulation result in 
Fig. 6.11 shows that the agreement of jc,'s is achieved while Fig. 6.12 illustrates the 
tracking error i, — v(t) converges to zero, V7. Note that the convergence in Figs. 6.11 
and 6. 12 is similar to that in Figs. 6.5 and 6.6. 

Unlike e in Example 6.3, i7 is a free parameter. We then increase I] to improve the 
convergence. Fig. 6.13 shows the agreement result for I] = 1 with the same initial 
conditions as the case of J] = 0.15. The convergence is faster and the oscillations 
are smaller than those in Fig. 6.11, where H = 0.15. □ 
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Fig. 6.11 Using the design (6.86) and (6.91) in Example 6.1, we achieve the agreement of jc,'s. 
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Fig. 6.12 The tracking error i, — v(f) converges to zero with the design (6.86) and (6.91). 
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Fig. 6.13 The oscillation is smaller than that in Fig. 6.1 1 and the convergence is faster because we 
have increased A, from 0. 15 to 1 in Example 6.5. 



6.7 Summary 



In this chapter, we studied the agreement of multiple Euler Lagrange systems with 
parameterized uncertainty. We presented two approaches to compensating for the 
uncertainty. The first approach relies on the inherent robustness of Euler Lagrange 
systems and employs the "adding cross term" technique. The second approach feeds 
forward the external feedback and takes a form similar to the Slotine-Li type con- 
troller. The second approach achieves a faster convergence than the first approach. 
However, relative velocity information must be available to implement the second 
approach. 



Chapter 7 

Synchronized Path Following 



7.1 Introduction 

So far, we have focused on the formation control problem — maintaining the desired 
formation towards a common goal. For reconnaissance missions or maneuvering 
around obstacles, additional requirement of following a specified path may be re- 
quired. The primary goal in path following problems is to design control laws that 
force the output of a system to follow a desired path. The secondary goal for the 
system is to obey a speed assignment along the path. In particular, we will consider 
the formulation where we parameterize the desired path by a path variable 0, and 
assign a desired speed to be achieved by 6 as t — > °°. Unlike the classical track- 
ing problem, where the speed assignment must be fulfilled for all t > 0, the path 
following problem offers flexibility to shape the transient behavior of . 

In this chapter, we exploit this flexibility to synchronize the path variables for 
a group of path-following systems. When the path variables reach agreement, the 
group exhibits a desired formation. Note that this synchronized path following ap- 
proach achieves the desired formation through coordinated tracking of individual 
trajectories. This is different from the formulations in position/distance-based for- 
mation control studied in Sections 2.6 and 2.7, where the desired formation is 
achieved by controlling relative configurations. To synchronize the path variables 9, 
we apply the agreement schemes in Section 2.5 and obtain two synchronized path 
following designs. In the first design, the agreement schemes in Section 2.5 and 
the path following systems constitute a cascaded system. Because the agreement 
schemes are globally asymptotically stable, we demonstrate the closed-loop stabil- 
ity of this cascaded system by showing that the path following systems are Input-to- 
State Stable (ISS). In the second design, we will represent the closed-loop system 
as the feedback interconnection of a dynamic block for path variable agreement 
and another block that incorporates the path following systems. Since the agree- 
ment block is passive, we prove closed-loop stability by showing the path following 
systems are also passive, thereby satisfying Structure 3 in Section 1.5. 
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7.2 Path-following Design and Synchronization 

Consider a general system 

x = f(x,u) 

y = h{x) { ' A) 

where x E K" is the state vector, y € R m is the system output and u € M" is the 
control. The objective of the path-following problem is to force y to a prescribed 
feasible path y c i{Q) and to assign a feasible speed v(t) to 9 on this path, where 9 is 
a scalar. The parameterized path yd{9) is a geometric curve defined as 

Y d := {y G K'"| 30 e R such that y = y rf (0)} (7.2) 

where y f / is continuously parameterized by the path variable 9. 

We assume that there exists a preliminary design that brings (7.1) to a closed-loop 
system of the form 

z = F(x)z-g{t,x,9)co 

9 = v(t)-co U ^> 

where z is a set of new parameters that include the tracking error y~y c i{9) and its 
derivatives, and ft) is a feedback term to be designed such that the reference velocity 
v(t) is achieved asymptotically, i.e., 

co — > as t — ► k>. (7.4) 

The matrix F(x) € R" x " and the vector g(t,x, 9) € W depend on the control design. 
In particular, F(x) satisfies 

PF(x)+F T (x)P < I (7.5) 

for some matrix P = P T > 0. In Section 7.4, we will demonstrate how to transform 
dynamics of marine vehicles to the form in (7.3) using backstepping. Preliminary 
design examples can also be found in [120]. 

Assumption 4 For the system in (7.1), a feasible path y^Q) and uniformly bounded 
speed assignments v(t) imply that the function g(t,X, 9) is uniformly bounded in its 
arguments. □ 

We now consider a group of agents i= 1 , • • • ,7V, each controlled by an individual 
path-following design with a prescribed velocity v(t), leading to the closed-loop 

system 

y { ii = Fi(xi)zi - gi{t,xt, 9i)cOj 

L, -\9 i = v(t)-co i . {Lb} 

The goal of synchronized path-following is to design ft), such that the path vari- 
ables 9j, i — 1, • • • ,N, reach agreement with (7.4) satisfied. The design of ft), de- 
pends on variables of agent i and the path parameters from its neighboring agents. 
Therefore, only one scalar variable needs to be transmitted from each agent to its 
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neighbors. For convenience, we introduce the concatenated vectors 



n 



,e N ] T a = [(o 1 ,---,a) N ] T (7.7) 



z=[z\,---,zn] 8 = diag{g u ---,g N }. (7.8) 



7.3 Passivity-based Designs for Synchronization 
7.3.1 Design 1: Without Path Error Feedback 

A straightforward idea for synchronizing 0,'s in (7.6) is to apply the result from 
Corollary 2.1. To this end, we design the input ft),- as 

(% = ^iW 1 d ik y k {6 k )\, i=l,... t N (7.9) 

where J^t{uj} satisfies (2.1 1)-(2.15) and G k is defined as 

^=i^={XXi k kt^- (7 - 10) 

The definition of dk is similar to z k defined in (2.2). Therefore, similarly to (2.6), we 
have 

0=D T e (7.11) 

where = [6>i,-" ,0i] T . The nonlinearity yf k (-) satisfies (2.27)-(2.31) and (2.46). 
Following Corollary 2.1, we conclude that (7.9) ensures global asymptotic stability 
of the origin of (£ , 6), i.e., the agreement of 0,'s and (7.4) are achieved. 

Does (7.9) also guarantee zt — > 0, i.e., does each agent track its prescribed path? 
To answer this question, we note that the closed-loop system (7.6) and (7.9) be- 
comes a cascaded system in Fig. 7.1. In Theorem 7.1 below we prove that Z; in 
(7.6) is Input-to-State Stable (ISS) with respect to ft),. Stability of the closed-loop 
system then follows because a cascade of an ISS and a uniformly globally asymp- 
totically stable system is uniformly globally asymptotically stable (see Theorem B.7 
in Appendix B.6). Thus, each agent follows its desired path, that is, z, — > 0. 

Theorem 7.1. Consider the dynamics of the ith agent in (7.6), i — 1, • • • ,N. Let the 
control ft); in (7.6) be as (7.9), where -J%i{-} denotes the output at time t of a static 
or dynamic block satisfying (2.11)-(2.15). Then, (7.6) is ISS with respect to the input 
ft); and the origin of(^,z,B) is uniformly globally asymptotically stable. □ 

Proof. We only need to demonstrate the ISS property of Z, in (7.6). To do so, we 
define a storage function 
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Fig. 7.1 Block diagram for the synchronized path following Design 1. 



vi = £zffl* 



where /^- is as in (7.5). Its time derivative along (7.6) is given by 

N 

V z <-^zjzi + 11^0)1 

1=1 

N 

i=i 



(7.12) 

(7.13) 
(7.14) 



where p,, 
we get 



\Pj 1 1 and 5 ? , is an upper bound on g, due to Assumption 4. Furthermore, 



M > 



2p im 8 gi \(Oi\ 



N 

(=1 



■e z< 



(7.15) 



where < £ < 1. Thus, it follows from Theorem B.8 in Appendix B.7 that the 



system is ISS from a>; to zi with p (r) 



Since the origin of (§, 6) is globally 



asymptotically stable by the design in Corollary 2. 1 and Z,- is ISS with respect to 
ft),, it follows from Theorem B.7 that the origin of (9,%,z) is uniformly globally 
asymptotically stable. □ 

Remark 7.1. The cascaded structure of Design 1 in Fig. 7.1 also allows us to apply 
the adaptive designs in Chapters 3 and 4 when only one agent has the v(t) informa- 
tion. Since the desired target set for 6 is the origin, both adaptive designs in Corol- 
lary 3.1 and Theorem 4.2 ensure ft) — > and the synchronization of 0,'s. Because 
the adaptive design cascaded to the Z, blocks is globally uniformly asymptotically 
stable and because the Z, blocks are proven to be ISS, it follows from Theorem B.7 
again that the tracking error z converges to zero and the synchronized path following 
is achieved. □ 



7.3.2 Design 2: With Path Error Feedback 



When the path error z, is available for feedback design, we choose 
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(Oi=<m{ui} (7.16) 

where 

l 

a, = 2zjp igl + £ d ik¥k (e k ). (7.17) 

7=1 

The first term in (7.17) is the path error feedback that serves to improve convergence 
to the desired path while the second term is for the agreement of 0,'s. 
If 3% is static, we restrict it to be of the form 

(Oi = hi{ui) (7.18) 

where the function /!,(•) is strictly passive as defined in Definition 1.3. 
If ,yt?i is dynamic, we assume that it is of the form 

& = /*(£)+«<(&)$ ( 7 - 19 ) 

(O i = h l (^)+b i (^ i )u l (7.20) 

where /;(•)> "/(•)> ^(')» anc ' ^'(') are continuous functions such that /}(0) = and 
/i,(0) =0. The main restriction of (7. 19)-(7. 20) is that it be passive with a C 2 positive 
definite and radially unbounded storage function 5i(§») satisfying 

s, < -Wife) + um - Ci u 2 , a > o (7.2i) 

for some positive definite function W/(|,-). Inequality (7.21) with c,- > implies strict 
input passivity, which can be achieved only when (7.19)-(7.20) has a relative degree 
zero. Our asymptotic stability result allows c, = if (7.19)-(7.20) has well-defined 
relative degree one at <f ; = 0, that is 

M6) = 0, 0,(0)^0, ^^16=0^0. (7.22) 

We thus make the following assumption: 

Assumption 5 Ifcj — in (7.21), then (7.22) holds. D 

With ft), in (7.16), the closed-loop system is shown in Fig. 7.2, where 

i 
Ui = -Y J d ikVk{Qk), u=[ui,-~,m] T (7.23) 

and 

Yi = 2ziPigi, Y = [Y h ---,Y N ] T - (7-24) 

We investigate stability properties of the closed-loop system by separating it into 
two blocks, S^\ and 5^2 as in Fig. 7.2, and analyze passivity properties of each block. 
The following theorem shows the passivity from — ft) to Y and thus the passivity of 
S2 from u to ft). Because S?\ is also passive (see Fig. 2.2 and Theorem 2.1), we 
conclude closed-loop stability from Structure 3 in Section 1.5. We further prove 
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Fig. 7.2 Block diagram for synchronized path following Design 2. 



the uniformly globally asymptotic stability of the origin of (0,1;, z) by applying the 
Nested Matrosov Theorem. 

Theorem 7.2. Consider the dynamics of the ith agent in (7.6), i — I,- ■ ■ ,N. Let the 
control COt in (7.6) be as (7.16), where ,^P[ satisfies (7.18)-(7.21). If Jtf is dynamic, 
we assume that Assumption 5 holds. Then, Z,- is strictly passive from — CD, to Yf. The 
origin of(B,,z,6) is uniformly globally asymptotically stable. □ 

Proof. We demonstrate the passivity of Ei by considering the storage function in 
(7.12), whose time derivative along the trajectories of z is given by 






■Yfa). 



(7.25) 



Thus, the passivity from — 0), to Yj is established. 

We combine the results from Theorems 2.1 and 4.2 to prove the uniformly glob- 
ally asymptotical stability of (^,z, 0). We first note from Theorem 2.1 that the feed- 
forward path 5?\ is passive from (0 to u with the storage function 



Vn 



k=l 



(7.26) 



where P k (-) is as in (2.27)-(2.31). 
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We next show the passivity of 5?i- Let J 1 denote the set of indices i — 1, • • • ,7Y 
for which J4% is a dynamic block as (7.19)-(7.20). Using the storage function Vb{£,) 
in (2.35) and (7.21), we obtain 

%&)= E(-Wt&) + u ( CBt) (7.27) 

N 

< - E w '^>) + E *««* - E »*«*. ( 7 - 28 > 

iGJ 2 ' 1=1 i£J 

We further use the Lyapunov function 

V = V b + V p +V z (7.29) 

whose time derivative is given by 

N 

V = E (-zfzi - c,uj + Y, T co, + ufa>, + utCOi) - E ui(Oi - E Wife) (7.30) 

('=1 i£J ieJ< 

N 

= - E ^ - E "<•»<' - E W(6) + c tf) < °- ( 7 - 31 ) 

1=1 i£f ieJ 

Finally, since the static blocks satisfy (1.25), 

E urn = E Uih{ui) > 0. (7.32) 

We thus conclude that V is negative semidefinite and that the equilibrium (z,%,6) = 
is uniformly stable. 

To establish uniform asymptotic stability, we use the Nested Matrosov Theorem 
in Theorem B.5. Similarly to the proof of Theorem 4.2, we define an auxiliary func- 
tion 

V 2 = -0 T D + rco (7.33) 

where D + is the pseudo-inverse of the incidence matrix D and F is a diagonal matrix 
with entries 

r (V^o))- 1 ifieJ 

In particular L„./z,(0) := H L._ Q a/(0) is nonsingular and thus invertible because 
of the passivity of the ^-subsystems in (7.19)-(7.20) and because of Proposition B.l 
in Appendix B.4. To apply the Nested Matrosov Theorem, we denote by Y\ the right 
hand side of (7.31) and claim that 

Yi = ==> V 2 := Y 2 < 0. (7.35) 

To see this, we note that Y\ = implies that % = and a> = 0, which mean that all 
terms in V2 vanish except 



138 7 Synchronized Path Following 

-e T D + ra\ Yi=0 . (7.36) 

Because ft>/L = L ai hi(0)iii when i € ^ and c, = and because Y\ — implies 
m, = for i $_ J* or c,- > 0, we conclude from (7.34) that -TooL _„ = u and obtain 

-e T D + rd)\ Yi=0 = -e T D+u. (7.37) 

Since Y\ — implies z = 0, it follows from (7.17) and (7.23) that 5, = — m. Substi- 
tuting (7.11) in (7.37) and using the property DD + D = D, we conclude 

Yi=0=^Y 2 = -B T DD + Dy(6) = -¥y(e). (7.38) 

Since %(•) satisfies (2.46), 8 T \j/(9) is positive definite in 0. Thus, (7.38) proves 
the claim (7.35). It follows from (7.31) and (7.38) that Y 1 = and Y 2 = together 
imply (z, §, 0) =0. All assumptions of Theorem B.5 are satisfied and we conclude 
uniformly globally asymptotic stability of (z, t, , Q) = 0. □ 



7.4 Design Example 

7.4.1 Agent Dynamics 

Suppose that each agent is a fully actuated tugboat with three degrees of free- 
dom, where the surge mode is decoupled from the sway and yaw mode due to 
port/starboard symmetry. The dynamic model of agent i, i = 1, • • • ,7V, is given by 
[49]: 

f\i = RiVt (7.39) 

M i V i +D i (v i )Vi = X i (7.40) 

where r/,- = [x/,y,,i//,] T , (xi,yi) € M 2 is the position vector in the inertial frame E, 
i//,- is the heading angle (yaw), and V; = [v,-j, v,-,2, Vi,3] T € R 3 is the velocity vector 
in the body frame B. The model matrices M, and £>,(v,) denote inertia, Coriolis 
plus centrifugal and damping, respectively, while T, £ R 3 is the generalized control 
forces and moments in the body frame. The matrix R{ ESO(3) is the attitude matrix 
of the tugboat with respect to the inertial frame (see Fig. 7.3). Given Xjff, Rj can be 
written as 

(cos y/, — sin y/,- \ 
siny/; cosy/,- . (7.41) 

l) 

Recall that /?, satisfies RfRi — h, and 

R, = RiSfi = WiSRi (7.42) 




7.4 Design Example 
where 



The numerical values for (7.39)-(7.40) are taken from [60] as 

/ 33.8 1.0948 
M,= 1.0948 2.764 
\ 23.! 

and 

7 0.1 0" 

Q(V/)= ( 0.10.5 
2 
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(7.43) 



(7.44) 



(7.45) 



3'e 




V,.i: surge 



\ffj : ship heading 



V; 3 : yaw 



<*' (x,y): ship position 
>, 



XE 



Fig. 7.3 Inertial and body reference frames for a ship. 



7.4.2 Trajectory Generation 

A group of N agents will have N individual paths where the desired path for agent 
i is given by 7] d i(6{) = [xdi(9i) >ydi(6i) > Vdi(9i)] T ■ To create individual path for each 
agent, we introduce a Formation Reference Point (FRP) and design a set of des- 
ignation vectors /, <E M 3 , relative to the FRP, corresponding to the desired forma- 
tion for agent (', i = 1, • • • ,N (see Fig. 7.4). Let the desired path for the FRP be 
T} d {9) = [x d (G), ya(8), Wd{0)] T - Then agent i will follow the path 



T] di {8) = T] d {8)+R{0)l i 



(7.46) 
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where R(0) =R(\jf £ /(8)) denotes the rotation matrix from the frame T to the inertial 
frame E. In particular, the tangent vector along the path is chosen as the x-axis of 
the frame T, that is, xj = -2& and y? = -M-, The desired heading yf c i(0) can then 
calculated as the angle of the tangent vector in the inertial frame 



Wd(e) 



arctan 



arctan 



de 

dy d (8) 
. de . 



(7.47) 




Fig. 7.4 Formation setup using a Formation Reference Point and designation vectors. 



An individual agent substitutes its own 0j into (7.46) and obtains its desired path. 
When Qj's are synchronized, these agents are in the desired formation configuration. 



7.4.3 Preliminary Backstepping Design 

To apply to the results in Section 7.3 to (7.39)-(7.40), we first present a preliminary 
backstepping design [120] that brings agent dynamics to the form in (7.6). Towards 
this end, we define the error variables 



Zi.i = r]i-T]di(0i) 

Zl.i = V ( -Oy 
COj = v(t ) - 0; 



(7.48) 
(7.49) 
(7.50) 



where zt,\ is the tracking error of agent (' and a,,i is the virtual control to be deter- 
mined later. 
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Step 1. Differentiating (7.49), we obtain 

Zi,x = i] i -^d i (7.51) 

= Ri{Vi){ai+Zi,%) - ^9i. (7.52) 

OUj 



Consider a Lyapunov function 



V iA =z] l P i ,iZi,u (7.53) 



where Pn = Pj x > 0. The time derivative of Vi\ along the trajectory of (7.52) is 
given by 

V iA = 2^^! Ui{m)Oi+Hm)z^-^^\ ■ (7-54) 

We now take the virtual control a, as 

a,- = *f ( W ) U,i ZM + ^v(o) (7.55) 



where A,- 1 is chosen such that 



P iA A iA +Al l P iA =-h. (7.56) 



This leads to 



V u = -zf l z i .i+2zJ l Pi,iR i (Wi)z i .2 + 2zf l P l A^co i . (7.57) 

Step 2. In this step, we will design T, in (7.40). In aid of this, we differentiate a,- 
in (7.55) as 

* = 0i + 3?ft ( 7 - 58 > 

of, 

where <T, includes all the terms that do not contain Of. 

ai=Rf(\i/i)Ri(Yi)ck+Rf{Yd U^(^)v; + ^v(f)) (7.59) 



and 



— -/?,.( V ,,)[-A, 1 — + 



We next consider the Lyapunov function 



Vi,2=zl 2 Pi,2Zi,2, (7.61) 
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where P\^ = P[ 2 > 0, and define V = V,i + VJ-,2- The time derivative of V is then 
given by 

V = V Ll +2zi 2 P L2 (v l -a i ). (7.62) 

From (7.40) , (7.57) and (7.58), we rewrite V as 

V = -zJ l ZiA+2zJ l P l ,iR i {y i )z l ,2 + 2zf l Pi. l ^(o l 
t ( i , , n x das 



+ 2zl 2 Pi, 2 \Mr\%i-D l {Vi)Vi)-o i --^d i ) . (7.63) 

We then pick 

T i =D i (Vi)Vi + M i (A it 2Zi,2 + (Ti+^v(t)-Pr 2 1 Rj(Y i )P i! u ll ) 



where A,-.2 is such that 

With (7.64), V is now given by 



fyAw+ALfy = -h- (7.65) 



7 .7 i o / T n dT\di . T n ' 



V = -z^za - z{ 2 Zi,2 + 2 U^Pi,! -^ + z{ 2 Pi, 2 -^ j co,. (7.66) 

We further obtain the closed-loop system in the coordinate of (zi,i,Zi,2) as 

£)-u*ur)(s)-(:j)- -> 



F, 



Hi 



which is of the form (7.6). Choosing P = diag{Pj i i,P/ i 2}, we observe from (7.66) 
that 

PFi + F, T P=-I 6 (7.68) 

which satisfies (7.5). Thus, the preliminary feedback (7.64) brings (7.39)-(7.40) to 
the form (7.6). 



7.4.4 Adaptive Design to Estimate Reference Velocity 

We consider a group of four agents following circle- shaped paths. The desired path 
for agent i is given by 

/x di (8i)\ /^cos^)' 
ri di (di)= y di (6i) = rising) | (7.69) 

Vvto(ft)/ V T^o + f 
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where r ; - is the circling radius of agent i. In the simulation we set r\ = 8, rj = 12, 
r3 = 16 and n, = 20. This implies that after agreement of all path parameters, the 
vessels will move along different circles parallel to each other. 

We assume that only agent 1 has the information of v(t) in (7.6). In the simu- 
lation, we choose v(t) — 10. According to Remark 7.1, the other agents implement 
the adaptive design from Chapter 3 to estimate the v{i) information. Since v(t) is 
constant, this adaptive design takes the following form 

0, = Vi - (Oi (7.70) 

e 
$i = -1iE«kYfc(&), Yi>0- (7-71) 

The initial conditions for 77,'s are given by 771 (0) = [0 § ] r , 772(0) = [1 - 2 |] T , 
773(0) = [5 - 2 |] T and r/ 4 (0) = [0 - 5 - f } T . The initial velocity for each agent 
is zero. The communication topology is given by the incidence matrix 



D = 



( -1\ 

1 

-1 1 

\0 -1 1 J 



(7.72) 



which means that only agents 2 and 3, agents 3 and 4, and agent 4 and 1 can ex- 
change their path parameters. We set 6>i(0) = 94, 6*2(0) = -137 and 6*3(0) = 90. 
The initial conditions for v,'s are £2(0) = —8, £3(0) = —10, £4(0) = 9. In (7.9), we 
choose yfk(x) = x and M'i as a constant gain of 0.5. The control parameter P[ is set 
to Pi = diag{0.2,0.2, 1, 10, 10,40}. 

Fig. 7.5 shows that before we turn on the adaptation at 100 sec, the agreement 
of the path variables is not achieved since each agent has different v(t) informa- 
tion. After the adaptation is turned on, the path parameters are synchronized. Five 
snapshots of the formation are shown in Fig. 7.6. Initially, four agents are inside the 
smallest circle. Before we turn on the adaptation, the agents are not synchronized 
on their paths (agents 1 and 4 are ahead of agents 2 and 3). After the adaptation is 
turned on, the agents achieve the synchronized path following. 



7.4.5 Saturation in Thrust 

In this example, we assume that the propeller of agent 4 saturates and is only able 
to produce a surge sped less than the speed assignment. In this case, the saturation 
constraint of agent 4 will cause steady state errors in the agreement of the path 
variables. This error is eliminated by employing integral feedback from relative path 
variables between neighbors. In Design 2, the effect due to the thruster saturation 
can be reduced with a proportional-integral-derivative (PID) control structure with 
limited integral and derivative effect, also known as a lead-lag controller with the 
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Fig. 7.5 Before adaptation, the path parameters cannot be synchronized since each agent has dif- 
ferent v(f) information. After the adaptation is turned on, the synchronization errors converge to 
zero. 
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Fig. 7.6 Five snapshots of the formation: Initial formation is inside the smallest circle. Before we 
turn on the adaptation, the agents are not synchronized on their paths. After adaptation is turned 
on, synchronized path following is achieved. Agents 1, 2, 3 and 4 are denoted by x , o, o and □. 
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following transfer function: 



H p id,i{s) = v,-ft , flii rT77^ 
i + PiHiS l + ajT di s 



(7.73) 



in which v, > 0, < T dA < /i„ 1 < j3, < °°, and < a, < 1. Since (7.73) is Hurwitz 
and satisfies Re \Hpid,i{jG))\ > v, > for all ft), H p i d j(s) falls into the class of input 
strictly passive systems and stability of the interconnection follows from Theorem 
7.2. The control ft), is then given by 



GJrM = H pid,i{s)Ui{s) 



(7.74) 



We use Design 2 in Section 7.3.2 and compare the synchronization errors for 
constant //»w,-, i.e., 7]/ ; = ju,- = in (7.73) while v,- = 10, with the PID structure 
where T c u = 1, /1, = 2, v, = 1, a, =0.1 and j3,- = 10. The other parameters are the 
same as in Section 7.4.4. The synchronization errors 0,'s shown in Fig. 7.7 illustrate 
that the PID structure yields a better agreement of the path variables when agent 4 
saturates. 
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Fig. 7.7 The synchronization errors 0, for the PID and the constant gain designs. 



7.5 Summary 



We applied the passivity-based agreement design in Section 2.5 to a formation track- 
ing scheme where path following systems are synchronized by communicating path 
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variables. The passivation designs offers the flexibility to consider several designs 
for synchronization where robustness and performance issues can be addressed. The 
first design only employs the information of the synchronization error and becomes 
a trajectory tracking control as the path variables reach agreement. The second de- 
sign makes use of both path error and synchronization error and allows users to pri- 
oritize path convergence. We presented a design example that incorporated the basic 
adaptive design in Chapter 3 and that also explored the flexibility of the passivity- 
based design to reduce saturation effects. 



7.6 Notes 

• The path following results in this chapter are mainly based on [59]. Other tech- 
niques for path following of a single vehicle include [2, 40, 54, 1 14, 3, 45]. 

• The idea of incorporating synchronization/consensus schemes into formation con- 
trol has also been considered in other studies, e.g., [12, 127, 150]. 



Chapter 8 

Cooperative Load Transport 



8.1 Introduction 

For the examples presented in previous chapters, the relative information between 
agents, such as relative distances/positions, relative path variables, is obtained 
through explicit information flow, including sensor measurements and direct com- 
munication. While such explicit information flow exists in many cooperative control 
applications, there are situations where the information flow is implicit. For exam- 
ple, suppose that several people move a table and only one person knows where to 
go. Then, even without explicitly talking to or seeing each other, those people are 
able to adjust their velocities and forces, and finally succeed in moving the table to- 
wards the target. In this example, the communication is implicit, and people receive 
the information (e.g., where to go, how fast to go) by feeling the contact forces and 
the trend where the table is going. 

In this chapter, we consider a group of agents handling a flexible payload. These 
agents are modeled as point robots with double integrator dynamics. As the agents 
move, the payload may be squeezed or stretched, generating contact forces to the 
agents. The contact forces between the agents and the payload are modeled as gra- 
dients of nonlinear potentials that describe the deformations of the payload. Be- 
cause all the agents are attached to the payload, the contact forces can be consid- 
ered as implicit communication between the agents with the payload acting as the 
"medium". Our objective is to employ this implicit communication to design de- 
centralized control laws such that the contact forces are regulated at some setpoints 
and that the agents and the payload move with the same constant velocity in the 
limit. We assume that the deformations of the payload are so small that the motion 
of the payload can be approximated as a rigid body. This assumption is reasonable 
when a rigid load is surrounded with bumpers or elastic materials. Another illustra- 
tion of this assumption is multiple grippers grasping a rigid load, where the grippers 
possess compliance from installed flexible mechanisms. 

Recall that the formation control designs in Sections 2.6 and 2.7 employ virtual 
attractive/repulsive force feedback between the agents. For our load transport prob- 
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8, 




Fig. 8.1 Multiple agents are attached to a common flexible load. Initially the payload is unde- 
formed and agent i is attached to the point o, . If the load is deformed by agent ;', the position of 
agent i, Xu is different from a, and the deformation is approximated by 5, = x, — a,. 



lem, the contact forces play the role of physical force feedback between the agents 
and the payload. With this idea in mind, we consider the case where a constant refer- 
ence velocity is available to all the agents. We propose a decentralized controller that 
guarantees the force regulation and the velocity convergence of the agents and the 
payload. This controller, consisting of an internal velocity feedback and an external 
force feedback from the payload, exhibits a similar structure to the position-based 
formation control design in Section 2.6. Exploiting this similarity allows us to ap- 
ply adaptive design techniques to the load transport problem. For example, when 
no reference velocity is predesigned, we employ the adaptive design results from 
Chapter 3 and augment the decentralized controller with an integral control term. 
The resulting control law recovers the traditional integral force control and ensures 
the agents to achieve the same constant velocity and the force regulation without 
explicit communication. 



8.2 Problem Formulation 

Consider JV planar agents holding a common flexible load as shown in Fig. 8.1. Each 
agent is modeled as a point robot. Suppose that the load is initially undeformed and 
that agent i is attached to the load at the point a,-, i.e., x,(0) = a,(0) = x c (0) + r,-, 
where x c € R 2 and xi £ R 2 are the inertial positions for the center of mass of the 
load and agent (', and r, is a fixed vector in the inertial frame. Assuming that the 
initial orientation of the load Q c is zero, we define 

<t):=x c {t)+R(B c )r u W^ sin0 ; COS0 ;J (8.1) 
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whose kinematics are given by 

. • /-sin0 r -cos0 f \ 

a > =x < + e c{co S e c -sinej r <-- (8 - 2) 

Note that a,(f) represents the position where agent i is attached if the payload is 
undeformed at time t. As the agents move, however, the flexible payload may be 
squeezed or stretched and therefore x,(t) ^ a,(?). The deformation of the payload, 
approximated by 

8j =xi-ai, i=l,---,N, (8.3) 

generates a reaction force fi to agent i. This reaction force fi must be zero when the 
payload is not deformed by agent i, i.e., when 5{ = 0. Define the relative position 
between agent i and the center of mass of payload as 

Zi=Xi-x c , (8.4) 

and note from (8.1) that 

8i=Zi-R(d c )ri. (8.5) 

We next assume that the reaction force fi is the gradient of a positive definite and 
strictly convex potential function -P,(z,), that is, 

fi = VPi(zi) (8.6) 

where Pi(zi) satisfies 

P i ( Zi )=0^^ Zi -R(8 c )r i = (8.7) 

VPi(zi) = ^ Zi-R{8 c )n = 0. (8.8) 

The strict convexity assumption is satisfied by the linear spring potential model 
Pi(8i) — bj\zi — R(9 c )n\ 2 , b[ > 0, and certain classes of nonlinear models, such as 
Pi(S i ) = b l \z i -R(e c )r i \ 4 . 

We further assume that the deformations are small enough so that the dynamics 
of the payload can be approximated as a rigid body. This assumption is reasonable 
when a rigid object is surrounded by deformable materials (e.g., bumper) and the 
agents are attached to those materials. The dynamics of the agents and the payload, 
restricted to purely translational motion, are given by 

mm = Fi-fi, i=l,...,N (8.9) 

N 

M c x c = Y<f' ( 8 - 10 > 

i=\ 

where m, and M c are the masses of agent i and the load, Fi is the force applied to 
agent i, and /, is the contact force defined in (8.6). 

The control objective is to design Fi in a decentralized way such that all the agents 
and the payload converge to the same constant velocity, while the contact forces on 
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the load are regulated, i.e., f maintained at a setpoint ff. Because the load must 
move with a constant speed, ff's are subject to the following constraint: 

N 

The setpoints ff should also be chosen to satisfy desired properties, such as force 
closure [94], which ensures that the agents always squeeze the pay load at the desired 
stage. This requires the knowledge of the payload geometry and the grasping points. 
In the following sections, we consider the control design in two cases: first, a 
reference velocity v is predesigned and available to each agent; second, v is not 
available to each agent. The second case includes as a special case the situation 
where v is available only to the leader. 



8.3 Decentralized Control With Reference Velocity 

We note from (8.6) that the reaction force /, depends on the relative position z,. If z; 
can be regulated to some desired state, /; would also be maintained accordingly. To 
this end, we assume that for a given ff, there exists a deformation zf , such that 

ff = VPi{zi). (8.12) 

Note that achieving a desired contact force ff is now equivalent to driving the rela- 
tive position Zi in (8.4) to the desired one zf- Let 

t, c = x c - v, B,t = xi - v, (8.13) 

and 

^[tfV-.&f, z=[z\,---M T , z d = [(4) T ,-,(4 / ) T ) T - (8-14) 

Our control objective is thus convergence to the equilibrium S p 

S p = { (§,&,*) | § = 0, %o = 0, and z = z d ) . (8.15) 

This equilibrium is the same as that of the position-based formation control prob- 
lem in Section 2.6, where the relative positions between agents are driven to some 
desired values. Indeed, if the payload is treated as the A^+ 1th agent, convergence to 
(8.15) means that zt, the relative position between agents i and the payload, is driven 
to the desired value zf- Thus, the interactions between the N +1 agents display a 
star graph with the payload at the center, as shown in Fig. 8.2. If the contact forces 
(and thus the relative positions) between agent i, i= 1 , • • • , iV, and the payload can 
be regulated, the relative positions between the N agents are maintained tightly. 
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To achieve the convergence to (8.15), we propose the following control law for 
agent i 

F i = -K i (x i -v)+ff, K,=Kf>0. (8.16) 

This feedback law transforms the agent dynamics (8.9) to the form 

m i x i = -K i (x i -v)+ff-f i (8.17) 

which consists of an internal motion feedback that drives the agent's velocity to v, 
and an external force feedback that regulates the contact force. We further rewrite 
(8.17) and (8.10) as 

x t = & + v (8.18) 

m4t = -K£ i + u i (8.19) 

x c = ^ + v (8.20) 

M c % c = u c (8.21) 



where 
and 



ff-fi (8.22) 



TV 

u c = — ^Uj. (8.23) 



Note that the agent dynamics (8.18) and (8.19) are of the same form as the position- 
based formation control in (2.19) and (2.20). In particular, the external feedback u\ 
in (8.22) is the gradient of the potential function 

Pi{li) = P,(z,)-P,(zf) - (ff) T (z, -*?). (8-24) 

Thanks to the strict convexity of Pi(zi), Pi(zi) is positive definite as proven in Lemma 
8.1 below and thus plays a similar role to Pk{zk) in (2.27). 

Lemma 8.1. The potential function (8.24) has a unique global minimum at zt = zf' 
and is proper. □ 

Proof. A direct application of Proposition B.2 in Appendix B.8. □ 

The payload dynamics (8.20)-(8.21) are almost the same as (8.18)-(8.19), except 
that (8.21) is only passive from u c to £, c whereas (8.19) is strictly passive from 
m, to §,-. This is because the payload dynamics (8.10) have no damping. If (8.21) 
were strictly passive, we could follow the results in Corollary 2.2 and prove the 
convergence of i, and x c to v and Zi to zf , which means f —* ff. As demonstrated 
in the following proposition, these convergence results still remain true when (8.21) 
is only passive. The proof for this proposition, given in Appendix A.5, relies on the 
implicit communication topology in Fig. 8.2 and the fact that v is constant. 
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agent 1 



(\ payload 



agent 2 ( ) ( ) agent 3 



Fig. 8.2 The implicit information topology between three agents and the payload displays a star 
graph. 



Proposition 8.1. Consider the decentralized control law (8.16) with the dynamics 
(8.9) and (8.10), where ft is defined in (8.6). Then, the equilibrium defined in (8.15) 
is globally asymptotically stable. In particular, /,■ — > ff, i — 1, • • ■ ,N. O 



8.4 Decentralized Control Without Reference Velocity 

In some situations, the reference velocity information may not be available to the 
agents. For example, the agents may have a common velocity direction but have 
not agreed on how fast to go. In this case, it is important for the agents to reach 
a common velocity. To achieve this, we follow the results from Section 3.6 and 
develop an adaptive control with which each agent estimates the group velocity. We 
define v; as the velocity estimate for agent i and propose the following update law 
for vf. 

h=Hff~fi) (8-25) 

in which A,- = Af > 0. Note that v,- stops updating when ff = /,, that is, the contact 
force is regulated at the desired setpoint. This means that z, remains constant and 
thus agent i and the payload have the same velocity. If all the agents have the same 
velocity as the payload, they move with the same velocity. 
We next modify the design in (8.16) as 

Ft = -K,(x, - Vi)+mji + ff. (8.26) 

With a slight abuse of notation, we let 

$i=Xi-Vi (8.27) 

and transform (8.9) and (8.26) to 

Xi = $ i + v i (8.28) 

m& = -K& + ff-f i . (8-29) 

Define 
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e=[(vi) r ,---,(%)T, (8-30) 

and note that the equilibria set of (8.10), (8.28) and (8.29) is given by 



£\ 



'; = { (§,* c ,z,v) | § = 0, v- l N ®x c = and z - z f/ } . (8.31) 

Convergence to <?* means that all the agents maintain the desired contact forces on 
the payload while reaching the same velocity as the payload. This agreed velocity 
depends on the initial condition of v and the initial velocity of the payload and is 
characterized in the following proposition: 

Proposition 8.2. Consider the decentralized control laws in (8.25) and (8.26). The 
trajectories of (%,x c ,Z,v) remains bounded and converge to St in (8.31). In par- 
ticular, x c and Xj 's converge to v £ Mr, which is the weighted average of the initial 
payload velocity x c (0) and the initial velocity estimates v;(0), i= 1, ■ ■ ■ ,N: 

N N 

v=(EA r ' +M c r 1 (M c .i c (0) + £A.- 1 v/(0)). (8 .32) 

i=i i=l 



A special example of the design (8.25)-(8.26) is when only one agent, say agent 
1, has the v information. In this case, agent 1 can choose to turn off the estimation 
(8.25) by selecting A\ = and letting v\ (0) = v. This leads to the same controller 
in (8.16) for agent 1. A simple calculation from (8.32) shows liin^^o v = v, which 
means that the other agents asymptotically recover the v information and the group 
will eventually move with the velocity v. 

Proposition 8.3. Suppose that agent 1 has the v information and implements (8.16) 
while the other agents apply the control (8.25) and (8.26). Let 

vi=h0, vi = vi-v, i = 2,---,N (8.33) 

and v — [(vi ) T , • • • , (v w ) r ] r . Then the equilibrium 

SI = { (§, §.,i?, z ) | § = 0, & = 0, v = and z = z d ) (8.34) 

is globally asymptotically stable. □ 

The proofs for Propositions 8.2 and 8.3 are given in Appendix A. 6 and A.7, respec- 
tively. 
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Fig. 8.3 Experiment testbed of two PUMA 560 arms: Mounted on the wrist of each arm is a 
six-degree freedom force/torque sensor. 



8.5 Experiments 
8.5.1 Hardware 

The experimental testbed, shown in Fig. 8.3, consists of two PUMA 560 arms, each 
with a six-degree of freedom force/torque sensor mounted on the wrist. Two com- 
puters running xPC target perform the real time control of the two arms and the data 
acquisition is achieved using PCI interface boards. Those two xPC computers run 
at an update rate of 1 kHz and only do low level tasks. There is a user interface 
host computer that runs MATLAB and communicates with the control computers 
through an Ethernet cable using UDP This configuration allows high level pro- 
cessing and control to be done on the host computer while the control computer 
implements the low level control loop. 



8.5.2 Implementation 

Because the end-effector of the PUMA arm is of six-degree freedom, we consider it 
as a fully actuated agent. To simplify the implementation and reduce the effects due 
to the uncertainty of the arm inertia, we choose to implement the control laws on 
the kinematic level rather than the dynamic level. Motivated by a standard singular 
perturbation analysis of (8.17) for small m,-, we obtain the following controller by 
setting the right hand side of (8.17) to zero (i.e., setting x,- = 0) and solving for i,-: 

*t = r t (tf-f { )+v, /=1,2 (8.35) 

where i7 > 0, x\ and X2 are the positions of the end-effectors of the right and the left 
PUMA arms. This controller can also be justified by treating the payload as a vir- 
tual massless spring and invoking a Lyapunov analysis of (8.35) with the Lyapunov 
function £? =1 Pi(zi). Likewise, for the adaptive design, we implement 
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Fig. 8.4 The first two stages of the experiments (pictures were taken from the top view): (a) The 
two manipulators first move towards the ball, (b) Once the end-effectors reach the ball, we imple- 
ment the control (8.35) with v = [0 0] r m/s to achieve the static holding. The left (right) PUMA 
refers to the left (right) manipulator in this figure. 



x i =r i (ff-f i )+v i 



(8.36) 



and keep the update law of v, the same as (8.25). 

Once the force measurement fi is available, i, is computed from (8.35) or (8.36) 
and transformed to the joint velocities using the pseudoinverse of the manipulator 
Jacobian matrix, i.e., 

qt = J+Xi (8.37) 

where qi is the joint velocity, and / ( - is the pseudoinverse of /, that satisfies 

Xi=J, qi . (8.38) 



The joint positions for the next step are calculated from g, as 

qi{k+\) =qj(k) + Tqi 



(8.39) 



where T — 0.001 sec is the sampling period. The next step joint position is then 
tracked by a low level PID controller. 

Our experiments are performed in the following steps: 

Approaching. As seen in Fig. 8.4(a), the two manipulators approach a light 
weight soccer ball, fed by a person, along the y direction. In this stage, we tune 
the positions and the orientations of the end-effectors so that they are aligned on the 
same line. Thus, their motions are restricted to the same plane. We also ensure that 
the end-effectors are normal to the contact surfaces and that the line connecting both 
end-effectors approximately passes through the center of the ball. This guarantees 
no rotational motion when the manipulators squeeze the ball in the next stage. 

Static Holding. As the end-effectors reach the ball in Fig. 8.4(b), we turn on 
the controllers in (8.35) with v — [0 0] r m/s, which implies that the ball will be 
held statically and squeezed. The squeeze forces are along the y direction and their 
desired setpoints are chosen to be ±10 N, that is jf = [0 - 10] r N and /| = [0 10] r 
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N, thereby satisfying (8. 11). These large desired squeeze forces help maintaining the 
grasp and reducing the rotational motion of the ball. In case that the squeeze force 
drops below a certain amount (6 N in the experiments), we increase the feedback 
gain I] to quickly drive /; back. To ensure that the end-effector does not slip on 
the contact surface, we discard the force measurements along the x direction, which 
means that only the squeeze forces are controlled. Once the squeeze forces reach 
the desired setpoints, we start to move the ball with several basic maneuvers as 
discussed below and evaluate the performance of the proposed controllers in these 
cases. 

Moving the object. In this step, we examine the proposed controllers in the sce- 
narios of moving with v available, estimation of v, and circular motion. In all cases, 
we setI7 = r = 0.001. 



8.5.2.1 Moving with v available 

After the stage of static holding, we move the object along the x direction by as- 
signing v = [0.02 0] r m/s to each end-effector. The force measurements from the 
sensors on the wrists are shown in Fig. 8.5. The squeeze forces are maintained at 
±10 N at the stage of static holding (17.5 sec-30 sec) and oscillate more around 
±10 N when the end-effectors start moving (30 sec-47.5 sec). This is partially due 
to the dynamic effects of the low level PID tracking controller that we ignored in the 
implementation. The trajectories of both end-effectors from 30 sec to 47.5 sec are 
shown in Fig. 8.6, where no significant rotational motion is observed. Moreover, we 
compute from Fig. 8.6 that the approximate average velocities along the x direction 
is 0.02 m/s, the same as v. 



8.5.2.2 Adaptive Estimation of v 

We now examine the adaptive design (8.36). We assign v = — [0 0.005] r m/s to the 
right PUMA while the left PUMA has no v information. Therefore, the left PUMA 
needs to implement the adaptive design (8.36) to estimate v. The initial estimate of 
the left PUMA vi (0) is chosen as zero. Since the motion is along y direction, which 
is the squeeze direction, no rotational motion is generated. The experimental results 
are shown in Fig. 8.7, where the estimate from the left PUMA converges to —0.005 
m/s meanwhile the squeeze forces are well maintained at ±10 N. 

As a comparison, we implement the nonadaptive control law (8.35) with v = 
-[0 0.005] T m/s for the right PUMA and v = [0 0] T m/s for the left PUMA. Since 
they do not have the same v information, the experimental results in Fig. 8.8 show 
the existence of steady state errors in both the reference velocity tracking and the 
force regulation when the end-effectors are moving (after 30 sec). These results are 
expected from the analysis in Section 3.2. In fact, the two end-effectors, along the y 
direction, are governed by 
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Fig. 8.5 The force measurements along the squeeze direction when the manipulators move in the 
x direction. 
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Fig. 8.6 The trajectories for both end-effectors when they are moving forward with v = [0.02 0] r 
m/s. 



x hy = r(f( y - f Ly ) - 0.005, i 2 , v = r(f( y - f 2 . y ) + o (8.40) 

where the subscript y denotes the y component of each vector. Rewriting (8.40) as 
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*i,v = n/f,v - ^ -h.y) - 0-0025 (8.41) 



i 2 ,v = r(fl y + °^ -f 2j ) - 0.0025, (8.42) 



■'2,v 

we see that due to the different reference velocities, the desired setpoints ff are 
shifted to new setpoints ff . It is easy to calculate ff as ±12.5 N, which matches 
our results in Fig. 8.8(b). 



8.5.2.3 Circular Motion 

Although the nonadaptive controller (8.35) is restricted to the case of constant ref- 
erence velocity, we test it when the end-effectors move in a circular motion. In this 
case, we choose ff = [0 — 15] 7 " N and ff = [0 15] r N and show both the force 
measurements and the trajectories of the two end-effectors in Fig. 8.9. Fig. 8.9(a) 
illustrates that once the end-effectors start circling (after 40 sec), the squeeze forces 
cannot be maintained at ±15 N due to the periodic motion. The tracking of the ref- 
erence velocity is not well achieved as the trajectories in Fig. 8.9(b) are not a perfect 
circle. This is because the force and the motion control are coupled all the time. 

To improve the tracking performance, we add deadzone for the force error ff — f 
so that the force and motion control are decoupled when /, is close to ff. The 
experimental results in Fig. 8.10 illustrate the improvement of the tracking of the 
circles at the cost of more fluctuating contact forces in the deadzones. 



8.6 Summary 

In this chapter, we studied a motion coordination problem where a group of agents 
move a flexible payload. The contact forces, which describe the relative information 
between the agents and the payload, build up implicit communication in the group. 
When the desired constant velocity is available to each agent, we developed a de- 
centralized controller that achieves the convergence to the reference velocity and 
the force regulation. We also considered the situation where the reference velocity 
is not available. We proposed an adaptive control that recovers the nonadaptive re- 
sults. Both nonadaptive and adaptive control laws were compared to the formation 
control designs in Chapters 2 and 3. Experiments were performed and validated our 
designs. 
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8.7 Notes 

• There exists a considerable amount of literature on the control design for the load 
transport problem, including the motion/force control [89, 145], event-based control 
scheme [92], caging without force control [106], leader/follower comply mechanism 
[154, 57]. Reference [130] employed screw theory to examine a system of two non- 
holonomic wheeled mobile manipulators holding a common load. In [129] multi- 
gripper grasping was considered, where one of the grippers is rigid and the others 
are flexible with built-in linear springs, and stabilization control laws were devel- 
oped to achieve both position and force control. The authors of [128] considered 
transporting large (possibly flexible) objects as an impedance control problem and 
performed experiments using multiple mobile robots with manipulators to validate 
their controllers. The work in [134] studied modeling and manipulating totally de- 
formable objects and the solution is centralized and based on finite element model. 
Our model of the pay load is different from [134] since we assume the payload to be 
a partially flexible object that deforms only around the grasping points while [134] 
considered the deformations of all particles on the flexible object. 

• In contrast with the existing literature, we solved the load transport problem in 
a similar way to the position-based formation control problem. The goal of this 
chapter is to address in the same passivity-based framework the connection between 
the formation control problem, where the interaction force is virtual, with the multi- 
agent load carrying problem, where the interaction force is physical. 

• Expanding the dynamics (8.9) with the adaptive control law (8.25) and (8.26), we 
obtain 



m l x l = -K I (x,-v' d (0))+m i v i +f?-f i + K t Ai I (jf-fi) (8.43) 



integral force control 

which is of the integral force control form [89, 145]. Such an integral force control 
has been shown in [145] to be robust with respect to small time delay in the force 
measurements. 

• We have considered only translational motion of the agents and the payload in this 
chapter. This is a simplifying assumption, which allows us to illustrate in a basic 
form the connection between the formation control problem and the load transport 
problem. Indeed, if the grasp is rigid, the agents are capable to exert torques to the 
payload. Then the dynamics for the orientation of the payload become 

N N 

Ic8c = Y, r ic X fi + L r i < 8 - 44 ) 

i=l (=1 

where I c is the inertia of the payload, T, is the torque transmitted to the payload from 
agent i, r, c = xi — x c , and 

ric x fi = rf c ff-riff, (8.45) 
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in which r ic = [rf c ,rf c ] T and fi = [fj c ,fJ] T - Our assumption on pure translational 
motion means that T,'s may be chosen to stabilize 9 C to a constant. The design of 
T; may require the information of r, c and C . Once 9 C is stabilized, the remaining 
motion would be only translational. Therefore, the formulation in this chapter only 
reflects the translational part of the load transport problem. 
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Fig. 8.7 Estimation of v: The estimate of v converges to its true value in Fig. 8.7(a) while the 
squeeze force measurements are well-maintained at ±10 N as in Fig. 8.7(b). 
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Fig. 8.8 When the end-effectors have different v information, the tracking of the reference velocity 
is not achieved as in Fig. 8.8(a) and the contact forces cannot be maintained at the desired ones as 
in Fig. 8.8(b). 
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Fig. 8.9 The squeeze forces cannot be maintained at the desired value due to the circular motion 
of the end-effectors. The trajectories of the end-effectors in Fig. 8.9(b) show that the tracking of 
the reference velocity is not well achieved. 
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Fig. 8.10 With deadzones added to the force control, the tracking of the reference velocity is 
improved as in Fig. 8.10(b) while the squeeze forces are more fluctuating in Fig. 8.10(a). 



Chapter 9 

Caveats for Robustness 



9.1 Introduction 

In this chapter we consider the position-based formation control design in (2.74) 
and investigate its robustness with respect to switching topology, link gain variation 
and unmodeled dynamics. For convenience, we rewrite (2.74) here 

{M®I p )x+(K®I p )x+(L A ®I p )x = (9.1) 

where M = diagjmi , • • • , nips}, K — diag{&i , • • • , Icn} and L& = DAD T is the weighted 
Laplacian. Recall that (9.1) ensures global asymptotic stability of the origin of x and 
z= (D T ®I p )x. 

We first analyze (9.1) with switching topologies. Such switching may occur due 
to the vehicles joining or leaving a formation, transmitter/receiver failures, limited 
communication/sensor range, or physical obstacles temporarily blocking sensing 
between vehicles. For single integrator dynamics, switching topology has been stud- 
ied in [63, 103] and stability under arbitrary switching has been ascertained for 
classes of coordination algorithms. In contrast, for second order dynamics, we illus- 
trate with an example that a destabilizing switching sequence that triggers instability 
exists. We then show that stability is maintained when switching is sufficiently fast 
or slow so that is does not interfere with the natural frequencies of the group dy- 
namics. 

We next investigate stability properties when the link weights are perturbed by 
small sinusoidal oscillations. To illustrate this instability in its most basic form, 
we make a simplifying assumption that the perturbation is sinusoidal and trans- 
form the group dynamics into a form that reveals a parametric resonance mecha- 
nism [52, 96, 53]. This transformation employs the spectral properties of the graph 
Laplacian and decouples the relative motion from the motion of the center of the 
agents. When mass inertia and damping terms are identical for all agents, we obtain 
decoupled Mathieu equations [96], which make parametric resonance explicit. For 
broader classes of mass and damping matrices, we obtain coupled Mathieu equa- 
tions and discuss which frequencies lead to parametric resonance. Next, we show 
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that sinusoidal perturbations do not destabilize the system if they are slow or fast 
enough. The sinusoidal perturbations studied in this situation are not necessarily the 
most commonly occurring ones in practice. However, they allow us to study worst- 
case scenarios to deepen the understanding of fundamental stability and robustness 
properties in cooperative systems. 

We finally study the effect of input unmodeled dynamics, such as fast actuator 
dynamics. Following standard singular perturbation arguments, we prove that the 
stability of the nominal design that ignores the effects of unmodeled dynamics is 
preserved when the stable unmodeled dynamics are sufficiently fast. As we illustrate 
with an example, how fast the unmodeled dynamics must be is dictated by the graph 
structure and the mass inertia matrix. 



9.2 Instability due to Switching Topology 
9.2.1 Example 

Consider four agents with an undirected graph that switches between a ring graph 
and a complete graph' . Let M = I, K = kl and A = 8 1 for some constants k > and 
8 > 0. Then, the closed-loop dynamics (9.1) become 

Z + k*+8(Li®Ip)x = /=1,2 (9.2) 

where L\ = DjDf is the Laplacian matrix for the ring graph when / = 1 , and for the 
complete graph when (' = 2. 

Because L\ and L-i admit the same set of orthonormal eigenvectors qj, j = 1 , • • • , 4 
for their eigenvalues {0, 2, 2, 4} and {0,4,4,4}, respectively, the change of variables 
dj = (q T ; ®I p )x, j = 1, • • • ,4 decouples the dynamics (9.2) into 

d J + kd J + 8X} i dj = 0, (9.3) 

where Xj t is the yth eigenvalue of the Laplacian L,-, i = 1,2. It then follows from 
standard results in switching systems [81, 1, 80] that, if the damping k is small, and if 
8Xj [ < 1 and 8Xj 2 > 1, then (9.3) is destabilized by a switching sequence that selects 
i = 1 when djdj > and i = 2 otherwise. Instability with this sequence follows from 
the Lyapunov-like function V = ||d/|| 2 + ||dv|| 2 which increases along the trajectories 
of (9.3). Because the eigenvalues X^ and Xj lj switch between the values 2 and 4 in 
our example, if 8 £ (1/4,1 /2) , then 8Xj t < 1 and 8Xj 2 > 1 indeed hold for j = 2,3. 
This means that, when the damping is small, a destabilizing switching sequence 
exists. 

We demonstrate this instability with a simulation in Fig. 9.1. We choose p = 1 
and four agents. Although the system (9.1) guarantees agreement of x,'s for any 

A complete graph is a graph where every two nodes are connected. 



9.2 Instability due to Switching Topology 



167 



fixed connected graph, when the communication topology switches between a com- 
plete graph and a ring graph according to the sequence described above, Fig. 9.1 
shows that the relative distances between the agents diverge. 




Fig. 9.1 A switching sequence described in Section 9.2.1 between the ring and complete graphs 
destabilizes the relative positions between the agents in the system (9. 1). 



9.2.2 Comparison with First-order Agent Models 



The instability example presented in the previous section occurs only when the agent 
dynamics are second or higher order. In this section, we show that for agents mod- 
eled as first order integrators, switching between connected graphs will not lead to 
instability. In fact, the agreement of x,'s can be achieved even if the graph loses 
connectivity pointwise in time. Note that for first order agents, the Jffj's in Fig. 2.2 
are simply static passive blocks. We then restrict our attention to the following class 
of first order agreement protocols 



M N 

' L d 'k ( ? ) V* ( z k ) *k ■ = Y, d Jk ( f ) X J ' 

k=\ ./=! 



(9.4) 



rewritten in vector form (2.3), (2.24) as 
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x = -(D{t)®I p )y(z) (9.5) 

z=(D(t) T ®I p )x (9.6) 

where the multivariable nonlinearity \ff(-) has the property (2.46). The matrix D(t) is 
piecewise continuous because its entries exhibit step changes when a change occurs 
in the communication topology. We define the time varying graph Laplacian as 

L(t) = D(t)D(tf . (9.7) 

If the graph remains connected for all t > 0, that is, if 

fa{L{t)}>o c >0 V?>0 (9.8) 

for some constant a c > that does not depend on time, then it is not difficult to show 
that x,-'s in (9.5)-(9.6) reach an agreement despite the time-varying L(t). We now 
prove agreement under a less restrictive persistency of excitation condition which 
stipulates that graph connectivity be established over a period of time, rather than 
pointwise in time: 

Proposition 9.1. Consider the system (9.5)-(9.6) where x € W N comprises of the 
components x,- G W, i = I,- ■ ■ ,N concatenated as in (2.3), \ff(-) satisfies (2.46), and 
D(t) is piecewise continuous incidence matrix. Let S be an (N — I) x N matrix with 
orthonormal rows that are each orthogonal to Ijv; that is, 

S\ N = Q SS T =I N - l . (9.9) 

If there exist constants 8 > and a > such that, for all to > 0, 

fto+8 



f° SL{t)S T dt>af (9.10) 

Jtn 



where L(t) is defined in (9.7), then the protocol (9.5)-(9.6) achieves the agreement 
ofxj's. □ 

The proof of this proposition can be found in Appendix A. 8. The "persistency of 
excitation" condition (9.10) means that SL(t)S T is nonsingular when integrated over 
a period of time, and not necessarily pointwise in time. Since, by construction of S 
in (9.9), SL(t)S T inherits all eigenvalues of L(t) except the one at zero, its smallest 
eigenvalue is 

h{SL{t)S T } = k 2 {L(t)}, (9.11) 

which means that nonsingularity of the matrix SL(t)S T is equivalent to connectivity 
of the graph. Because Proposition 9.1 does not require nonsingularity of SL(t)S T 
pointwise in time, it allows the graph to lose pointwise connectivity as long as it is 
established in the integral sense of (9.10). The pointwise connectivity situation (9.8) 
is a special case of Proposition 9.1 because, then, (9.10) readily holds with a = G c 8. 
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9.2.3 When is Stability Maintained? 

Having demonstrated stability robustness of first order agreement protocol with re- 
spect to switching topology, we now come back to the second order protocol (9.1) 
and consider under what conditions the stability of (9.1) is maintained. 

Since (9.1) guarantees the origin of (x,z) to be exponentially stable for a fixed 
connected graph, using the concept of dwell-time [81, 91, 56], we can ensure x — > 
and z — > if all graphs in the switching sequence are connected and if the interval 
between consecutive switchings is no shorter than some minimum dwell time x > 0, 
where estimates for x can be obtained following [56]. We next employ the concept of 
an "average graph" to show that fast and periodic switching also preserves stability. 

Consider a periodic switching sequence a(t) in which the topology switches 
n—\ times, n>\, during one period T. We label n graph Laplacians in T as V A , 
i = 1, • • • ,n and denote their dwell times by T,, i = 1, • • • ,n, £" =1 Xi=T. We thus 
study the switched system: 

(M®I p )x+ (K®I p )x + {L a A (t) I p )x = (9.12) 

where 

I* W e{Li,4,...,L5}. (9.13) 

To determine the stability of (9.12)-(9.13), we investigate the eigenvalues of the 
state transition matrix evaluated over a period T: 

S(r,0)=e A ' vT ' v ---e A2T2 e AlTl , (9.14) 

where 

is the system matrix of (9.12) in the coordinates of (x,x), i — 1, ■ ■ ■ ,N. When T,'s 
are small, we rewrite (9.14) as 

S(T,0)=f\[I+x i A i + O(xf)] 

i=\ 

N 



= /+^T ; A, + o(r 2 ) 



where 



and 



I + TA m ' + 0{T 2 ) (9.16) 



' r '=l -M% -M-^K) '» 



L^ = I£t,4 (9.18) 

1 i=i 
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is the average of the n graph Laplacians during the period T. 

Because the linear combination (9.18) preserves the structure of a Laplacian, 
L™ defines an average graph obtained by superimposing the individual graphs ;' = 
1, ■■• ,n. In this average graph, the links are weighted by XijT , which represents the 
relative dwell time of each graph constituting the average. This means that, if the 
time- varying graph is jointly connected as in [63], then the averaged graph described 
by Z/f is connected. We point out that the connectedness of If? also satisfies the 
persistency of excitation condition in (9.10) with 8 = T since for all to > 

SL a A (t) S T dt = SL l £S T , (9.19) 

'0 

which is positive definite if and only if the average graph is connected. 

We finally show that, when T is sufficient small, connectedness of the average 
graph implies stability of (9.12)-(9.13). To see this, note from (9.16) that the eigen- 
values of 3 (T, 0) are given by 

Ki = l + TXi + 0{T 2 ), j=l,...,2W, (9.20) 

where A,'s are the eigenvalues of A"''. It follows that if the graph induced by the 
averaged Laplacian If? is connected, then all A,'s have negative real parts, except 
the one, say X\, at zero. This zero eigenvalue results from the null space of A" v , 
spanned by a = [ljy 0jj) T , which is also the null space of A,, i = 1, • • • ,n. We thus 
conclude that E(T,0)a — a, which implies Ki = 1. Then, for sufficiently small T, Kj 
in (9.20), i = 2, ■ ■ ■ , 2N, remain inside the unit circle and K\ — 1 corresponds to the 
motion of the center, thereby guaranteeing the asymptotic stability of the subspace 
spanned by a — [ljy 0^] r . Note that convergence to this subspace guarantees x — > 
and z — > 0. 

Lemma 9.1. Consider the closed loop dynamics (9.12)-(9.13) with a switching sig- 
nal a{t) of period T. If the averaged graph induced by (9.18) is connected, then 
there exists a T* , such that for T < T*, the the subspace spanned by a= [1^ 0jf] T 
is asymptotically stable. D 



9.3 Parametric Resonance 
9.3.1 Example 

To illustrate parametric resonance in its most basic form, we study an example of 
the cooperative system (9.1) with M = I, K = ki and A = SI. To further simplify the 
notation we consider the single degree-of- freedom case p = 1. The same analysis 
extends to p > 1 with the use of Kronecker algebra. The graph is now time-invariant 
but the link gain 5 is perturbed by a cosine term e cos cot, thus leading to the closed- 
loop model 
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x + kx+(8 + ecoscot)Lx = 0. (9.21) 

Note from Property 1.2 that L can be diagonalized by an orthonormal matrix Q: 

Q T LQ = L d := diag{A w , • • • , X x } (9.22) 

where Xn > An_i > ••• > Ai — 0. If follows from Property 1.3 that if the graph is 

connected, then only X\ is zero and the corresponding column in Q is —= In due to 

VN 
Property 1.1. Thus, we let 

Q=[S T ^-\ N ] (9.23) 

where S satisfies (9.9), and decompose x as 

x = S T d+^c, (9.24) 



where d e M^" 1 and eel. 

The dynamics of c correspond to the evolution of the center of x and is obtained 
by premultiplying (9.21) by -j^ 1 N : 

c + kc = 0. (9.25) 

The solution c(t) approaches c(0)/k + c(0), which means that the time-varying link 
gains do not affect the motion of the center. 

Next we derive the dynamic equations for d. Since SS T — In- i , we obtain from 
(9.24) that 

d = Sx (9.26) 

which, from (9.21), results in 

d + kd + (5 + £ cos 0)t)SLx = 0. (9.27) 

We further note from (9.24) that 

SLx = SLS T d (9.28) 

and from (9.22)-(9.23) that 

SLS T = l d (9.29) 

where L d = diagjA^, • • • , A 2 }. Substituting (9.28)-(9.29) into (9.27), we obtain 

d) + kdj + (8 + e cos (Qt)Xn + i-jdj = 0, j=l,---,N-l, (9.30) 



which is a Mathieu equation [52, 149, 96] with the natural frequency ^/5A^ + i_ ; -. It 
then follows from standard results for the Mathieu equation that instability occurs 
when the frequency of the perturbation is around co = 2\/5A,-/r, r = 1,2,3,- • •, for 
each i = 2, • • • ,7V. When damping k is zero, parametric resonance occurs at these 
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frequencies for arbitrarily small £. For nonzero damping k, parametric resonance 
occurs for sufficiently large values of e. 



9.3.2 Coupled Mathieu Equations 

In the previous example, the assumptions that M = I and K = kl played a crucial 
role in obtaining the decoupled Mathieu equations (9.30). We now remove this as- 
sumption and study the case where M, K and A in (9.1) are diagonal matrices with 
not necessarily identical entries. We then reveal parametric resonance with an anal- 
ysis of coupled Mathieu equations as in [96, Section 5.4], [149, 53, 52]. When each 
link gain Si is perturbed by e5, cos cot, (9. 1) becomes 

Mx + Kx + D(A+ecosmA)D T x = (9.31) 

where A = diag{5i, • • • , 8f\. Premultiplying by the inverse of M, we obtain 

x + M~ l Kii. + M- l L A -s. + EcoscotM- l L A x = 0. (9.32) 

where L A = DAD T . The coordinate transformation y — ,'7~ l x, where S? is com- 
posed of the eigenvectors of M~ l L&, then leads to 

y + £r- 1 M~ 1 K£Ty+Ay + ecosa)t^~ 1 M~ 1 L A ^y = 0, (9.33) 

in which 

A =diag{A;v, ■■■,%!} (9-34) 

and A,'s are the eigenvalues of M~ l L&. Because a similarity transformation brings 
M~ l L& to the symmetric form M~?LaM~ ? , we conclude that A,'s are real and non- 
negative. Because .jV{D t ) is spanned by 1#, one of the eigenvalues of M~ l DAD T , 
say Ai, is zero and the corresponding column in & is In- Similarly to (9.23)-(9.24), 
we rewrite Sf as 

ST=\S 1 N ] (9.35) 



and note that 

x=,!7y = Sd+\ N c (9.36) 

where d £ R" _1 , and c £ R is the center of x. It then follows from (9.33) and the 
decomposition (9.36) that 

y + 3r- l M~ 1 K&y + Ay + ecos(Ot&~ 1 Ar l L A Sd = 0, (9.37) 

since l^c lies in jV(p T \ 

When the damping term K is small, the off-diagonal entries of J?~ l M~ l K& can 
be neglected [30], that is, 

Sr- x M~ l KSr w diag{*i, • ■■,k N }:=K (9.38) 
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where k, is the ith diagonal entry of Sf^M^KS? . The dynamics in (9.37) can then 
be written as 

9~'('M«)-— (c£$S)(0 **> 

( s* 

where S?~ x = ( .. 

We note from (9.39) that the dynamics of d are decoupled from that of c and 
that stability of the relative motion of the agents is determined by the cf-dynamics. 
Results for coupled Mathieu equations in [149, 96, 52] applied to (9.39) indicate 
that parametric resonance occurs around the frequencies 



and 



(0= v ' V j^k, j,k = 2,---,N. (9.40) 



2 \ 
co=-^, j = 2,---,N, r=l,2,3--- (9.41) 



For K 7^ 0, parametric resonance occurs at these frequencies if e is sufficiently large. 
The parametric resonance resulting from (9.40) is known as Combination Reso- 
nance because the excitation frequency CO is a linear combination of two natural 

frequencies JXj and y%k [149]. When (9.41) is satisfied, the corresponding mode, 
d-N-j+i, is excited and the resulting parametric resonance is called Subharmonic 
Resonance. Such resonances are well studied in structural mechanics literature and 
are not further discussed here. 



9.3.3 Fast Varying Perturbation 

In the examples above instability occurs when the frequency of the perturbation 
interferes with the natural frequencies of the cooperative system. We now show that 
if the perturbation is fast enough (i.e., large a>), the origin of (x, z) is asymptotically 
stable. In the next subsection, we investigate slow perturbations. 
Defining T/ = cot and denoting 

^ = (•)', (9.42) 

we rewrite the perturbed model in (9.32) as 

co 2 x" + coM- l Kx'+M- 1 (L A +ecosT f L A )x = 0. (9.43) 

Using the new variables z/ = z(t)/o, and Vf = x', we obtain from (9.43) that 
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V/\ _ \_(-M~ l K -M~ l D(A + ecosTf A)\ fvf 



V-i K " T )W <9M} 

S * ' 

Af(Tf) 

When co is sufficiently large, the averaging method [69] is applicable to (9.44) 
and the average of A^(Xf) is given by 

1 f 2n 
Af av = 27tJ Af (t) dt ( 9 - 45 > 



-M~ l K -M~ l DA 
D T 0/ 



I, (9-46) 



which is the system matrix of (9. 1 ) written in the coordinate of (x, z) . Therefore, A av 
is asymptotically stable. The following lemma is thus a consequence of Theorem 
B.9 in Appendix B.9: 

Lemma 9.2. Consider the closed-loop system (9.31). There exists a (Of > such 
that for CO > COf, the origin o/(x,zJ is asymptotically stable. 



9.3.4 Slowly Varying Perturbation 

To analyze the system (9.32) with slowly varying perturbation (small co), we look 
at its system matrix A s (t) in the (x, z)-coordinates: 

AM = [- M D r K -*^ + ««»«*)) . (9.47) 

Note that (x, z) is restricted to the following subspace 

S x = {(x,z)|x € R Np ,ze M{D T ®I P )}. (9.48) 

For any fixed t , if A + e cos cot A > 0(, that is 

0<£< min ^, (9.49) 

'=l,--/ O, 

it follows that the origin of (x,z) is asymptotically stable on S x , which implies that 
A s (t) restricted to S x is Hurwitz. 

We next evaluate the derivative of A s (t ) as 

A s (t)=( n ° N evfootM-iDl} 

\»£xN V£ J 

and compute its norm: 
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11^11 = f «l«)n! W A m , :x (^ A dt N m-2 DA ) 



= e(o\sm(a)t)\J Kax{AD T M- 2 DA) (9.52) 



< £a>Y / A max (ziZ) r M- 2 Z)z\). (9.53) 

Since ||A| is bounded, we conclude from Theorem B.10 in Appendix B.10 that 
for sufficiently small co or £, the origin of (x,z) of the perturbed system (9.32) is 
asymptotically stable. 

Lemma 9.3. Consider the closed-loop system (9.31). There exists a p. > such that 
for ECO < p., the origin of(x,z) is asymptotically stable. □ 



9.4 Unmodeled Dynamics 

We consider the following closed-loop system with unmodeled dynamics, i = 

mm = C& (9.54) 

e$i=A&+BiT, (9.55) 

where (9.55) represents the unmodeled dynamics, e > 0, A, is Hurwitz, and Tj is 

defined as 

t 

Ti = -ki± i -Y i dijSjZj. (9.56) 

7=1 

When £ is small, the unmodeled dynamics are fast. We further assume that the dc 
gain of the unmodeled dynamics is CiA^ l Bi = I so that the reduced model ob- 
tained by setting £ = in (9.54)-(9.55) is identical to (9.1). It then follows from 
standard singular perturbation arguments (see [69, Example 1 1.14] reviewed in Ap- 
pendix B . 1 1 ) that there exists £* such that for £ < £* , the origin of (x, z) is asymp- 
totically stable. 

To illustrate the dependence of £* on the graph and the mass inertia, we simplify 
the model in (9.54)-(9.55) by assuming M~ l K = kl p , A = 8l t , A = -I p , B = I p and 
C = I p . 

rn.xt = £ (9.57) 

e& = -£ + Tj. (9.58) 

Denoting t, = (M ®I p )%, we rewrite (9.57)-(9.58) in the compact form: 
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(9.59) 



Then, it is not difficult to show that the 3N eigenvalues of A are the roots of the 
following N characteristic polynomials: 



1 



-A, = 0, 



1,- 



,N, 



(9.60) 



where A,'s are the eigenvalues of M L, A Routh-Hurwitz argument further shows 
that the exact stability region in the parameter space is given by 



£ <£ 



k 

5Ama 



(9.61) 



where A, nax is the maximal eigenvalue of M~ l L. For sufficiently small £, (9.61) is 
satisfied and guarantees stability despite the unmodeled dynamics. Denoting m m j„ = 
min, »ii, we note that a conservative upper bound of Amax is -^— , which implies from 

(9.61) that if £ < ^p, the origin of (x,z) is stable. 

Note that, since Am ax is the maximal eigenvalue of M~ l L, e* depends not only 
on the graph structure, but also on the mass distribution of the agents. To illustrate 
this dependence, we consider four agents with k = 2, 5 = 1 and p=l. We compare 
£*'s under two graphs as in Fig. 9.2. WhenM = diag{5,3,2, 1}, we compute from 
(9.61) £* = 1.4797 for the star graph and £* = 0.8154 for the string graph, which 
means that the star graph is more robust for this M. However, when M = I4, e* = 
0.5,0.5858, respectively, for the star graph and the string graph, which implies that 
the star graph is now less robust. 



agent 3 




agent 2 agent 4 

Star Graph 



agent 2 Q 



o 



agent 1 Q 



o 



agent 3 agent 4 

String Graph 



Fig. 9.2 The two graphs used in Section 9.4 to illustrate the dependence of e* on the graph struc- 
ture and mass distribution. 
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9.5 Summary 

In this chapter, we investigated robustness of (2.74) with respect to switching topol- 
ogy, link gain variation and unmodeled dynamics. We illustrated with an example 
that switching topology can lead to instability and showed that the closed-loop sta- 
bility is maintained when switching is sufficiently fast and periodic. As a compar- 
ison, we also demonstrated that first order agreement protocols have the stability 
robustness with respect to switching topology. We next revealed a parametric res- 
onance mechanism by transforming the cooperative system with time- varying link 
gains into Mathieu equations. As in the case of switching graphs, stability is main- 
tained when the sinusoidal perturbation is slow or fast enough that it does not in- 
terfere with the natural frequencies of the group dynamics. We finally showed that 
for fast stable input unmodeled dynamics, the stability of the nominal design is pre- 
served. 

Robustness of cooperative control protocols is an area that requires further inves- 
tigation. Besides the three instability mechanisms presented in this chapter, other 
instability mechanisms should be revealed and robust redesigns need to be devel- 
oped. 



Appendix A 
Proofs 



A.l Proof of Corollary 3.2 

We consider Vf and Vt, in (2.35) and the following storage function 

v p [m) := l -m T m. (A.l) 

We obtain from (2.6) and (3.72) 

z={D T ®I p ){y + v) (A.2) 



where 

We compute Vf as 



v=[v T u ^,-y N ] T - (A3) 

V f = W T {D T ®I p ){y + v) 
= {{D®I p )y} T (y + v) 
= -u T (y + v). (A.4) 

The time derivative of V/, is given in (2.40). Using Lemma 3.1, we obtain 

V p = u T v. (A.5) 

Thus, the time derivative of V = Vf + V}, + V p yields 

V = -E^(£)-E M fv'<° (A.6) 

which proves the boundedness of (z(t) , ^ (t) , HI (t)) . We next apply the Invariance 
Principle and analyze the largest invariant set where V — 0. Applying the same 
analysis as in the proof of Theorem 3.1, we conclude from V — that t, = and 
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u = 0, which proves the convergence of (z(t), £, (?)) to S , where S is as in (2.33). In 
particular, when Property 2.1 holds, u = means Zk € ^4 and thus all trajectories 
(z(t ), £, (t)) starting in <S converge to the set sd . 

If condition 1 or 2 in Corollary 3.1 holds, u = implies z — (D T (g)I p )x = 0. From 
(3.72) and the fact that jV{p T ) is spanned by 1^, we further obtain |v,(r) — \>{t ) | — > 
for some v(f). 

If in (3.29)-(3.30), B t = B, we note that 

N N 

£v i (t)=B£tH i (A.7) 

whose time derivative is 

N N N N 

Y J Vi(t)=BY,®i = B{AY,® i +B T Y J Ui)- (A.8) 

i=\ i=l /=1 i=l 

It follows from u = -(D®I p )y and D T \ N = that Hf =l u t = (\ N ®I p ) T u = 0. 
Thus, letting % = ± Y4L1 ®U we obtain (3.73) from (3.29). Since \v t - v(t) \ -> 0, V/, 
(3.74) follows from (A.7). 



A.2 Proof of Corollary 3.3 

The proof is similar to Appendix A.l. Using the same storage functions as (2.35) 
and (A.l), we obtain 

V f = -{u + L v (y + v)) T (y + v), (A3) 

% as in (2.40) and V p as in (A.5). It follows that 

Vf + V h + V p = Y. W ^i) - L uJ yi -(y + v) T V sym (y + v) < 0. (A.10) 

ieJ* if J 

where we recall that V m = ^(L v + {L V ) T ) is positive semidefinite and has the only 
null space spanned by ljy if G v is strongly connected and balanced. 

Using the Invariance Principle, we further prove <!;— >0, y — > 0, u ^ and |v,- — 
v / 1 — *• 0. The rest of the proof mimics the proof in Appendix A. 1 . 



A.3 Proof of Lemma 4.2 

To prove Lemma 4.2 we use standard tools in multivariable calculus, and the results 
in unconstrained optimization theory. We let ji and fi be the Lipschitz constants 
of VF(x) and V 2 F (x), respectively, for x £ *2) C M. 2 . Then, the finite difference 
approximations in (4.49) and (4.50) satisfy [35, Ch.5] 
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||G»-VFfa)||i < l -y x h k (A.ll) 

\\H k -V 2 F{x k )\U < \njih k . (A.12) 

We prove Lemma 4.2 by induction. For k = 0, we first show that Ho is nonsingular. 
To this end, we let e < r, h k <h and 

S+-nh<^-, V<1, (A.13) 

3 pji 

where v < 1 is to be selected. To show that Hq is nonsingular, we employ the Banach 
Lemma given in [67, Theorem 1.2.1]. We first show that, with £ and h satisfying 
(A.13), \\V 2 F(x*)- l H -I\\ < 1 as follows: 

\\V 2 F(x*)- l H Q -I\\ = \\V 2 F( X *)-\H -V 2 F(x*))\\ 

<||V 2 F(x*)- 1 ||[||// -V 2 J F(xo)|| + ||V 2 F(xo)-V 2 F(x*)||] 

<PYi(e + -nh\ < v < 1. (A.14) 

Hence, the Banach Lemma implies 

S l-||V2F(^)-i// -/|| S 1-v' ( j 

Thus, Ho is nonsingular, and xi is well defined. Next we derive a bound on \x\ — x* \ 

as 

\x x -x*\ = |* +V G 0-**| 

< 1 1// - l 1 1 { |flb (xo - x* ) - V 2 F (x ) (x - x* ) | 
+ |VF(x ) - VF(x*) + V 2 F(xo)(x -x*)\ 
+ |Go-VF(x )|} 

" 7^;(l«72%o-*1 + f |x -x*| 2 +^ 

jS/2 /5 - e\ /3yjA 

-l^U + 2J |X °- JC| + 2(1^) 

< ^-^1 + 2^). ^ 16 > 

which further restricts v to be v < j for convergence. Note that, we want |xi — x* | < 
£ as well, hence v, h and £ must satisfy: 
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v B Yi h 

£+ -// <e=^e(l-2v)>j3yi/?. (A.17) 



1-v 2(1 -v 
We thus pick h as 



- 2e(l-2v) 

*<-^, (A.18) 



which is feasible when v < i. Then, it follows from (A. 13) that v must satisfy 



5 - ^ne(l-2v) v 

£ + -«/; = e+-5 — < — 

3 fin By 2 

^ v >AUy^ j (A .19) 

7i + f ney 2 

which is consistent with the condition v < 5 for a sufficiently small £. If h, e, and 
v are selected such that (A.13), (A.18), (A. 19) are satisfied, then 

|*i -x*\ < e=>x\ eB(i*,£) 

and (A. 14) and (A. 15) hold for Hi as well. Induction on k+ 1 goes similarly, and 
yields 

k 

\x k +i -x*\< (^ +1 \xo-x*\+h5 Y, c " ( A - 2 °) 

«=0 

where c := j^ and S := 2 (]- v ) • B ecause v < j implies c < 1, the sum in (A. 20) 



l-V """ " - — 2(1-V) • "<•"••»"<"* ' ^ 2 

J 
1- 



converges to Y*n=o c " = T=7 anc ' li m A'^~ \xk~ **| = 0(/i). 



A.4 Proof of Theorem 5.2 

We denote by ]8, the error variable 

A=A-/3 i = 2, — ,N (A.21) 

where j8 = [(/3 1 ) r , • • • , Q3 r ) T ] T and note from (5.55) that 

jg,=A ; (^(?)®/ 3 )M,- i = 2,~-,N. (A.22) 

For consistency, we set ]8i = 0. 

We then rewrite (5.10), (5.53) and (5.54) as 

l t /iA(bi+A&i x x f x l coi =-fiAcoi+ui (A.23) 

'^fjA'cQi + A'cQi x { fi 'cot = -fiA'dOi + Ui i = 2, ■ ■ -,N. (A.24) 
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To prove the stability of the closed loop system described by the adaptive scheme 
(A. 23), (A. 24), (5.18) and (5.55), we investigate the passivity properties of the in- 
terconnected system. To this end, we take the Lyapunov function 

V a = {qa-\() T {qo-\i)+q T v q v + -Am\ l /iA(Oi 

+ l ^{A'(O i f i / i A(o i + - £ ffAr% (A.25) 

i=2 i=2 

The derivative of V a along (5.7), (5.55), (A.23) and (A.24) yields 

N 
V a = -£'«, Ui+Acojui- fl\\A(Ql\\ 2 
1=1 

N N 

+ Y t {A~OH u t -fiWA'mf} + L Pf{®{t)®h)ui (A.26) 

i=2 i=2 

We further note from (5.51) and (5.52) that 

(A.27) 





(<*> 


(t) T ®b)Pi-- 


= *af - <o d 




from which we rewrite (A.26) ; 


IS 






V a = 


N T N 
;'=1 i'=l 


-/iii^f- 


-E/ill^a* 

;'=2 


= 


r N 

i=i 


H-fl\\AC0i\ 


N 

| 2 -I/,||^ 
i=2 


®i\\ 2 


= 


-/ill^H 2 - 


-L/ill^,i 

1=2 


l 2 <o, 





(A.28) 



where we used the fact Yli=\ u i = 0- The negative semidefinite derivative V a implies 
that the trajectories {qo,q v ,A(Oi,AcOi,Pi}, i = 2, ■ ■ ■ ,N are bounded. We further con- 
clude that A(D\ — > and 4o), ^ 0, i = 2,--- ,iV from Theorem B.5. By taking the 
derivative of (A.23) and (A.24), we note that Acdi and A(Oj are continuous and 
uniformly bounded because the signals {A(b\,A(Oj,A(Oi,A(Oj,u}, i — 2, ■ ■ ■ ,N are 
continuous and bounded. Thus, it follows from Theorem B.4 that A(b\ — *• and 
A(Ot — ► 0, i = 2, • • • ,7Y, which results in h, — ► from (A.23) and (A.24). Using argu- 
ments similar to those in the proof for Theorem 5.1, we conclude that q v converges 
to the null space of D <g> I?, . 

Suppose that q v — * 0. In this case, we note from (5.6) and (5.3) that D — ► (-D® 
1$) and D T — ► (— Z) r (81/3). On the other hand, since q v , qo and d) are all bounded 
and continuous signals, then from (5.7) we conclude that q\, is also bounded and 
continuous. From Theorem B.4, we obtain q v — > 0. Next we prove a> — > 0. This 
follows because <jv — > and because the Jacobian from 6) to <?v in (5.7) is always 



184 A Proofs 

full rank. Therefore, from (5.4) we conclude that co B converges to JV(D T ®Is) 
since D T — > —D T ^>I$. We further note that j¥(D t ®h) is spanned by In <£> c, 
where cgR 3 and that '©i converges to co d (t) because A(0\ — > 0. It follows that 
I'fflj— (O d {t)\ — * 0, which completes the proof. 



A.5 Proof of Proposition 8.1 

We take the following energy-motivated Lyapunov function 

V = t Pt(zt) + Ut %»*& + gM e &) (A.29) 

i=i z i=i 

where Pi{zi) is defined in (8.24), and £,- and § c are as in (8.13). From Lemma 8.1, 
the first term in (A.29) is positive definite. Then, the time derivative of V yields 

V = t(fi- fif* + 1 $™& + &M c Xc (A.30) 

i=i i=i 

From (8.2), (8.4) and the assumption 6 C = 0, the kinematics of z, are given by 

Zi = M - cii = xi - x c . (A.3 1) 

We next rewrite (A.30) from (8.9), (8.10), (8.11), (8.16) and (A.31) as 

i=i 1=1 i=i 

1=1 1=1 1=1 

i=l 1=1 

N 

= -£#*&■ <0, (A.32) 

i=i 

which implies the stability of the equilibrium S '. 

To conclude the asymptotic stability, we apply the Invariance Principle and in- 
vestigate the largest invariant set .-# where V = 0, i.e., § = 0. From (8.13), we note 
that |,- = implies that i, = v. We further obtain from q = that x\ = 0, which leads 
to Ft = fi from (8.9). Thus, it is clear from (8.16) that ff — /,. We now show that 
on M, x c = v. To see this, we note that ff = /,- implies that n = zf- Since zf is con- 
stant, we have z,i = on ^#, that is, from (8.4), i, = a,-. Because we consider only 
the translational motion and because i,- -von j%, we conclude that a,- = x c = v. 
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A.6 Proof of Proposition 8.2 

We choose the following Lyapunov function 

Vi=t p i&) + \t % m & + l*cMc*c +{t &) r V ft) (A.33) 

where § is defined in (8.27). The time derivative of V\ is given by 

Vi = - 1 iff - fifii + 1 & m & +$M c x c + t(ff~ fif*t. (A.34) 

i=l 1=1 1=1 

Note from (8.28) that 

ii = t,i+Vi-x c . (A.35) 

We then rewrite (A.34) from (8.10), (8.11), (8.29) and (A.35) as 

Vi =-t(f?-fi) T (S. + t.-*c) + ttf(- K & + f?-fi) 
/=i i=i 

+$ifi+ivf-ft) T *i 

(=i i=i 

= -f,&Kt$<0. (A.36) 

i=i 

We next apply the Invariance Principle and investigate the largest invariant set ^(* 
where Vi = 0. On ^#*, V\ = means <!;,■ = and thus 4 = 0, which further implies 
from (8.29) that fi = ff. Then, from (8.25), v,- = 0. It follows from x) = v, ■ = that 
±i is constant on j$* . Since f = ff = and ii = Xj — x c , we conclude xi — x c , Vi, 
which means that all the agents and the payload have the same constant velocity. 
Noting from (8.27) and £,- = 0, we further obtain x c = Vi, i = 1, • • • ,N. 
Next, from (8.10) and (8.25), we compute 



and 



M c x c {t) = [ V fi{s)ds + M c x c {0) (A.37) 

Jo ti 

Ut)= fAt(ff-Ms))ds + 1 t (0). 

Jo 



We rewrite (A.38) as 

AT\vi(t)-Vi(0))= fff-mds 
Jo 

and note from (8.1 1), (A.37) and (A.39) that 



(A.38) 



(A.39) 
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1=1 j ° (=1 

= -M t (i c (f)-i,(0)). (A.41) 

Because on ./#*, x c and v, are equal and constant, we obtain from (A.41) that x c = 
xi = V{ = v, where v is in (8.32). 



A.7 Proof of Corollary 8.3 

We consider the Lyapunov function below 

V a = ^Pi(zi) + \t % m & + \%M& + \ I(vO r A rl v/, (A.42) 

where § c is in (8.13), %\ = x\ — v, §j, i — 2,- ■ ■ ,N, are defined in (8.27) and v,-, 
i = 2, ■ ■ ■ ,N are as in (8.33). The time derivative of V a is computed from (8.10), 
(8.13), (8.11), (8.25), (8.28) and (8.29) as 

N 



;=1 

+cif,+i(ff-fi) T vi 



i=\ i=l 

= -£!;? K&KO (A.43) 

;=i 

which implies the stability of S* in (8.34). We perform a Lyapunov analysis similar 
to the proof in Appendix A.6 and conclude the global asymptotic stability of S* in 
(8.34). 



A.8 Proof of Theorem 9.1 

We define the new variable 

C := {S®I p )x, (A.44) 

where S is as in (9.9) and, thus, x,- — x ; — ► 0, V/,y, is equivalent to £ — *• 0. To prove 
asymptotic stability of £ = 0, we note from (9.5)-(9.6) that 

C = -(SD(t)®I p )\lf((D(t) T ®Ip)x) 
= -(QS(t)®I p )\lf((D(t) T S T ®I p )0 (A.45) 
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where we obtained the second equation by substituting 

D{t) T = D{tf S T S (A.46) 

and by using (A.44). To see that (A.46) holds, note that Zjy — S T S is an orthogonal 
projection matrix onto the span of 1#, and that 1^ is in the null space of D(t) T for 
all t > which, together, imply 

D(t) T {I N -S T S) = =► D{t) T =D{t) T S T S. (A.47) 

We then denote 

F(t):=SD(t)®I p (A.48) 

and conclude global uniform asymptotic stability for (A. 45) from Lemma A. 1 be- 
low. 

Lemma A.l. Consider the time-varying system 

C = -F(t) W (F(t) T Q (A.49) 

where £ £ W, F(t) is an n x r piecewise continuous matrix of t, and iff : W — > W 
is a locally Lipschitz nonlinearity satisfying (2.46). If F(t) satisfies ||/ 7 (?)|| < /i and 

t+S 

F{x)F{xfdx>aI (A.50) 

t 

for some constants jx, 8 and a that do not depend on t, then the origin £ = is 
globally uniformly asymptotically stable. □ 

Proof of Lemma A.l: We let 

^(f,Q:=F( ? ) T C (A.51) 

and note that the Lyapunov function 

ViiO^^Z (A.52) 

satisfies 

V 1 = -(/)V(0)=:y 1 (0)<O, (A.53) 

from which we conclude global uniform stability. To prove global uniform asymp- 
totic stability we employ the Nested Matrosov Theorem in Appendix B.5. To this 
end we introduce the auxiliary function 



V 2 (t,Q = -; 1 S(t); S(t):=J { e^F(x)F{T)'dT (A.54) 

where 

||5(?)|| <M 2 (A.55) 
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firom||F(f)|| <^i,and 

rt+8 
S(t)> e [, - x) F{x)F{T) T dT>ae- 5 I (A.56) 

from (A.50). Furthermore, S(t) = S(t) — F(t)F(t) T , from which we obtain 

V 2 <-C T S(t)C + ^ T ^-2t; T S(t)C. (A.57) 

Next, substituting (A.55)-(A.56) and |£| < /l|v(0)l obtained from (A.49), we get 

V 2 < -ae- s \C\ 2 + \<l>\ 2 + 2^\t;\\wm =■■ i2(C,0)- (A.58) 

When Fi(0) = in (A.53), it follows from (2.46) that </> = and, thus, F 2 (C. 0) = 
-a<?~ 5 |£| 2 < 0. Furthermore, Fi(0) = and F 2 (£,0) = together imply C = 0, 
which means that all conditions of the Nested Matrosov Theorem are satisfied and, 
hence, £ = is globally uniformly asymptotically stable. 



Appendix B 

Technical Tools Used in the Book 



B.l Schur Decomposition 

The Schur decomposition transforms a square matrix into an upper triangular matrix 
using a unitary similarity transformation. 

Theorem B.l. [15, Theorem 5.4.1] 

Let t£ C" x ". Then, there exist a unitary matrix Q G C" x " and an upper triangu- 
lar matrix B € C° x " such that 

L = QBQ l . (B.l) 

□ 

Because B is similar to L, B has the same eigenvalues with the same algebraic mul- 
tiplicities as L. Because B is upper triangular, these eigenvalues are the diagonal 
entries of B. 



B.2 Invariance Principle [69, Theorem 4.4] 

Theorem B.2. Let Q C D be a compact set that is positively invariant with respect 
to the following dynamical system 

* = /(*) (B.2) 

where f : D — > M" is a locally Lipschitz map from a domain D *eW into W. Let 
V : D — > K be a continuously differentiable function such that V(x) < in Q. Let E 
be the set of all points in Q where V{x) = 0. Let M be the largest invariant set in E. 
Then every solution starting in Q approaches M as t —> °°. □ 
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B.3 Barbalat's Lemma 

Theorem B.3 (Barbalat's Lemma). 

Let f{t) : R — > R be a uniformly continuous function on [0,°°). Suppose that 
lim r _^oo Jo f{s)ds exists and is finite. Then 

f(t)->0 as f-»oo. (B.3) 

n 

Barbalat's Lemma leads to the following convergence result: 

Theorem B.4. Let §(?) : R —> R be a continuous function defined on [0,°°). If 
4 it ) — * as t — » °° ancf 4 (?) w bounded, then §(?)—> as f — ► °°. □ 

Prao/ Let | (f ) be the /(?) in Theorem B.3. Then lim r ^oo J r /M^ = lim,^ 4 (f ) - 
4(0) = —^(0). Because 4 is bounded, /(f) is bounded and thus /(f) is uniformly 
continuous. The assumptions in Theorem B.3 are satisfied and it follows that fit) = 
4(f) —>0asf^oo. □ 

When applying Barbalat's Lemma to the Lyapunov analysis of nonautonomous 
systems, we have the following result: 

Theorem B.5. Suppose fit,x) is piecewise continuous in t and locally Lipschitz in 
x, uniformly in t, on [0,°°) x W. Furthermore, suppose /(f,0) is uniformly bounded 
for all f > 0. Let V : [0, °°) x R" — > R be a continuously differentiate function such 
that 

Wi (x) < V it , x) < W 2 (x) (B .4) 

. , N dV dV , s 
Vi {*) = -%+ ^/M ^ - W ( x ) ( B - 5 > 

Vf > 0, Vx £ MP, where W\ (x) and W 2 (x) are continuous positive definite functions 
and Wix) is a continuous positive semidefinite function on R". Then, all solutions 
of x = f(t,x) are bounded and satisfy 

W(x(t))->0 as t->0. (B.6) 

n 



B.4 Proposition 2.44 in [119] 

Proposition B.l (Relative degree of nonlinear passive systems). 

Consider an input-affine nonlinear system 



x = f(x)+g(x)u 
y = h{x). 



(B.7) 
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If(B.7) is passive with a C storage function S(x), then it has relative degree one at 
x = 0. □ 

The result in this proposition means that the matrix L g h(0) :— -Jp -ng(fy is 
invertible if (B.7) is passive. 



B.5 Nested Matrosov Theorem [85, Theorem 1] 

Theorem B.6 (Nested Matrosov Theorem). 

Consider a dynamical system 

x = F(t,x), xeM.". (B.8) 

For A > 0, define £i§(A ) := {x G M" |0 < \x\ < A }. Under the following assumptions, 
the origin of (B.8) is uniformly globally asymptotically stable. 

Assumption 1: The origin of (B.8) is uniformly globally stable. 

Assumption 2: There exist integers j, m > and for each A > there exist 

• a number /i > 0; 

• locally Lipschitz continuous functions Vj : R>o x R" — > M, / G { 1 , • • • , j }; 

• a function (j) : R> xR"^ R m ; 

• continuous functions Yi : W x R m — > R, i g { 1 , ■ • • , j'}; 

rac/z ?/;af, /or almost all (t,x) G M>o X ^(4 ), and all i € { 1 , • • • , y'} 

max{|V;-M,<HM)|}<M (B.9) 

?,(*,*) <y,(*,0(f,*)). (B.io) 

Assumption 3: For each integer k G {1, • • • , y'}. we have that 

A) {Yi(z, w) = 0,V( £{l,---,lc-l}, andall (z, y) G ^(4) x 3B($Cj\ 
implies that 

B) {Y k (z, W) < 0, for all (z, y) G ^(A) x SS(fL)}. 
Assumption 4: We have that the statement 

A) {Yj{z, w) = 0,V('G {1, ••-,;}, Wa/i (z,y) € ^(4) X^(jU)} 
implies that 

B) {z = 0}. □ 



B.6 Lemma 4.7 in [69] 

Consider the cascade system 

x\ = fi(t,x\,x 2 ) (B.ll) 

X2=h{t,x 2 ) (B.12) 
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where f\ : [0, oo)xi"ix R"2 -*■ R"' and f 2 : [0, °°) x M"2 -> M"2 are piecewise con- 



tinuous in t and locally Lipschitz in x 

X2 

Theorem B.7. Under the stated assumptions, if the system (B.ll), with x\ as input, 
is Input-to-State Stable (ISS) and the origin of (B.12) is globally uniformly asymp- 
totically stable, then the origin of the cascade system (B.ll) and (B.12) is globally 
uniformly asymptotically stable. □ 



B.7 Theorem 4.19 in [69] 

Theorem B.8. Consider the system 

x = f{t,x,u) (B.13) 

where f : [0, °°) x W x W" — > M." is piecewise continuous in t and locally Lipschitz 
in x and u. Let V : [0, °°) x W — > R be a continuous differentiable function such that 

ai(\x\)<V(t,x)<az(\x\) (B.14) 

dV dV 

+ f(t,x,u)<-W 3 (x), V|x|>p(|«|)>0 (B.15) 

at ox 

\/(t,x,u) £ [0,°°) xl"x W", where <X\, CUi are class K^ functions, p is a class K 
function and W3 (x) is a continuous positive definite function on W. Then the system 
(B.13) is input-to-state stable. □ 



B.8 Proposition 2 in [65] 

Proposition B.2. Assume a storage function H(x) is strictly convex. Then, for every 
x* € W, the new storage function Hq(x) defined as 

Hq{x)=H{x)-H(x*)-(x-x*) t VH(x*) (B.16) 

has a unique global minimum at x* and is proper. □ 



B.9 Theorem 10.4 in [69] 

This theorem is a combination of Theorem 10.4 in [69] and the discussion right after 
that theorem. 
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Theorem B.9. Let f(t,x,e) and its partial derivatives with respect to (x, e) up to 
the second order be continuous and bounded for (t,X, s) € [0, °°) x Dq x [0, So], far 
every compact set Dq C D, where D <zW is a domain. Suppose f is T -periodic in 
t for some T > and £ is a positive parameter. Let x(j,e) and x av (e?) denote the 
solutions of 

x = sf(t,x,e) (B.17) 

and 

x = £/avM, (B.18) 

respectively, where 

/*r(*) = 4/ /(*.*,0)dT. (B.19) 

V Jo 
T/'fAe origin x — di D is an exponentially stable equilibrium point of the average 
system (B.18) and /(f,0,e) = for all (t,£) € [0,°°) x [0,£o], f/zen f/iere exists pos- 
itive constants £* and k such that, for all < £ < £*, ffte origin is an exponentially 
stable equilibrium of (B.17). O 



B.10 Theorem 3.4.11 in [62] 

We only cite the relevant part of [62, Theorem 3.4.1 1] below. 
Theorem B.10. Consider a linear time-varying dynamical system 

x(t)=A(t)x(t), i(r)el". (B.20) 

Let the elements of A(t) in (B.20) be differentiable and bounded function of time 
and assume that /?e{A,(A(?))} < — G s Vf V; = 1, •■-,«, where A,(A(f)) is the ith 
eigenvalue ofA(t) and O s > is some constant. 

if I \A 1 1 < H is satisfied for some positive ^ and Vf > 0, then there exists a pi* > 
such that if /J S [0,jU*), the equilibrium state x e of (B.20) is globally uniformly 
asymptotically stable. □ 



B.ll Summary of Example 11.14 in [69] 

In [69, Example 11.14], the following system 

x = f(t,x,v) (B.21) 

£z=Az + Bu (B.22) 

v = Cz (B.23) 

is considered, in which /(?,0,0) = and A is a Hurwitz matrix. Assume that 
—CA~ l B = I. Setting £ in (B.22) to zero leads to the reduced model 
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x = f(t,x,u). (B.24) 

Let u = y{t,x) be a state feedback control law such that the origin of (B.24) is 
exponentially stable. Assume that / and y is sufficiently smooth. Then the origin of 
the original system (B.21)-(B.23) is exponentially stable for sufficiently small e. 

B.12 Rigid Body Attitude and Its Parameterizations 

This section presents the basic language and tools used to describe the rotational mo- 
tion of a rigid body. Recall that a body is rigid if the distance between any two points 
fixed with respect to the body remains constant. If the body is free to move and ro- 
tate in space, it has 6 degree-of-freedom (DOF), 3 translational and 3 rotational. 
When restricted to plane, the body has 3 DOF, 2 translational and 1 rotational. This 
section will mainly focus on the rotational motion of a rigid body. 

B.12.1 Rigid Body Attitude 

As shown in Fig. B.l, the attitude of a rigid body in E 3 (Euclidean 3-space) is 
completely characterized by an orthonormal frame R attached to the body. Define a 
right-handed orthonormal frame as follows: 

Definition B.l. R = (ri ri r^) is an orthonormal frame if 

1. ||rj|| = 1, i = 1,2,3 (normality) 

2. T\ ■ rj = if i ^ j, i, j = 1, 2, 3 (orthogonality) 

3. ri x Ti — r$ (right handed rule) 

where rfs are in E 3 . □ 

Given an inertial frame E, we can write the coordinates of rj's in E as r, £ R 3 , that 
is, r, = [/■,-,!, r,-,2, n.i] T ■ Then the attitude matrix R of the body in E is given by 

R=\ n n r 3 ]. (B.25) 

The attitude matrix R satisfies R T R = Ij (due to orthonormality of r,'s) and detR = 1 
(due to the right handed rule) and thus belongs to the Lie Group SO(3) (Special 
Orthogonal group of dimension 3) defined as 

SO(3)={R£M? xi \R T R = I 3 , detf?=l}. (B.26) 

The matrix R is used not only for representing the attitude of a rigid body in the 
inertial frame E, but also for rotating a vector from the inertial frame to the body 
frame or vice versa. Given a vector v £ E 3 , let its coordinates in the inertial frame 
E be ve £ K 3 - Then its coordinates in the body frame vb are given by 
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Fig. B.l The attitude of a rigid body is characterized by a frame R : 
R in an inertial frame E, we obtain an attitude matrix R as in (B.25). 



(i"i,r3,r3). If we represent 



= R T v E . 



(B.27) 



B.12.2 Parameterizations of Attitude Matrix 

Note that there are 9 parameters in the attitude matrix R, subject to 6 constraints 
imposed by the orthogonality. For manipulation, analysis and implementation, it 
is more convenient to use other representations of SO(3 ). The minimal number of 
parameters that represents SO(3) is 3 since there are 3 free parameters in the atti- 
tude matrix. We will present a class of three-parameter representations of SO(3), 
including Euler-Rodriguez parameters, Modified Gibb's vector, and vector quater- 
nion. This class of parameterizations all consists of the equivalent axis and angle 
defined by the Euler Rotation Theorem below: 

Theorem B.ll (Euler Rotation Theorem). Every rigid body rotation is equivalent 
to rotating around an equivalent axis over an angle. □ 

Let the equivalent axis of R be represented by a unit vector /z S R 3 and the corre- 
sponding rotation angle be 9. We consider three-parameter parameterizations of the 
following form: 

s = y{9)h (B.28) 

where y(-) : R — > R is a C 1 odd function. Included in this general parameterization 
(B.28) are 



a 

y(9) — tan( — ) (Euler-Rodriguez parameter, Gibb's Vector) (B.29) 



7(0) = tan(-) (Modified Gibb's Vector) 
y(8) = sin( — ) (Vector quaternion) . 



(B.30) 
(B.31) 
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Other parameterizations, such as Euler angles and roll-pitch-yaw angles, are also 
commonly used in literature. However, only the parameterization in (B.28) is dis- 
cussed in this section since it is closely related to Chapter 5. 

The representation (B.28) can be uniquely determined for almost all R E 5(9(3). 
Since (B.28) relates to the equivalent axis/angle, we first present how to obtain (h. 9) 
from a given R E 5(9(3). 

The relationship between the rotation matrix R and (h,9) is given by the Ro- 
driguez formula 

R = I 3 + sm9h+(l-cos8)(h) 2 (B.32) 

where^ is defined in (1.5). We obtain from (B.32) that 

cos0 = ^(tr(fl)-l) (B.33) 

and 

sin0 = -h T {R-R T Y (B.34) 

which implies 

9 = a .t a .n2(^h T (R~R T )\^(tr(R) - 1)) (B.35) 

where tr(R) is the trace of R and v is the inverse operation of ~ defined in (1.6). 
By manipulating the diagonal elements of R, we can find an expression for the /th 
element of h: 

where Rn denotes the (i,j)th element of R. The computation of (B.36) breaks down 
when tr(R) = 3, that is, R — Ij, which implies from (B.33) that 9 = 0. In this case, 
h can be an arbitrary unit vector. Once h is obtained, we compute 9 from (B.35). 

Note that (B.36) gives two solutions to h, which results in two solutions to (B.35). 
These two solutions correspond to the two possible direction of rotation to achieve 
the same end result. We can also verify from (B.32) that both (0,h) and (— 6,— h) 
yield the same R. 

The parameterization (B.28) can be readily obtained from h and 9. Thanks to the 
odd function y(), (6,h) and (— 8,— h) yield the same parameterization s. Thus, for 
almost all R E 50(3), one can uniquely determine a three parameterization s from 
(9,h) and (B.28). In certain cases, for example when 9 = 7rand y{9) =tan|, (B.28) 
is not well defined. This is because all three-parameter representations of S 0(3) are 
only locally one-to-one and onto. As we will present in the next section, there are 
always singularities for three-parameter representations of 5(9(3). This singularity 
issue can be eliminated by using unit quaternion, which uses the least possible num- 
ber of parameters (4) to represent the orientation globally. Besides, unit quaternion 
is of great interest due to its desirable computational properties [144]. 

A unit quaternion q consists of a scalar part qo and a vector part q v E R 3 , which 
can be defined through (9,h) as 
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e e 

qo = cos — and q v = h sin — . (B.37) 

It follows that \q\ = q\ + q^q Y — 1. Thus, q is a four-parameter representation of 
SO(3) with one constraint. Given q, the corresponding attitude matrix R can be ob- 
tained from 

R = h+2q q v + 2q v 2 . (B.38) 

It is easy to verify from (B.38) that both q and — q correspond to the same attitude 
matrix R. 

Conversely, given R <G SO(3), we obtain from (B.33) that 



/ 1 + cos 6 1 , — 

■■Ml — ;r- =±W 1 +*( R )- ( B - 39 > 

When qo ^ 0, we find 



go = cos - _ . . 

H 2 V 2 2 



R-R T ^ 



*-\-^) (B.40) 

because (B.38) implies R — R T — 4qoq v . When qo — 0, q v can be found as the unit 
eigenvector ofR corresponding to the eigenvalue 1 since Rq v = q v . 



B.13 Rigid Body Kinematics 

The rotation kinematics of rigid body describes the evolution of the attitude of a 
rigid body. Let the attitude matrix of a rigid body with respect to an inertial frame 
at time t be R(t). Using the identity R(t)R T (t) = 1$, we obtain 

R(t)R T {t)+R(t)R T (t) = (B.41) 

which implies R(t)R T (t) is skew symmetric. Defining 

(D = R(t)R T (t) (B.42) 

we get 

R = COR (B.43) 

where (0 G M 3 is defined as the angular velocity of R(t) represented in the inertial 
frame. If we denote by co B the angular velocity of R(t) represented in the body frame 
R(t), we obtain from (B.27) 

(O B =R T C0 (B.44) 

and ^ 

R = RC0 B R = RC0 B (B.45) 

where we used the fact ^ 

Rxo B =Ra B R T . (B.46) 
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Let s e 
given by 
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be a parameterization of R ESO(3). Then the time derivative of s is 



J s (s)co 



(B.47) 



where J s (s) is called the representation Jacobian matrix. If J^ ; (s) loses rank, s cannot 
be computed from (B.51). This is called a singularity of the representation s. 

We now compute the Jacobian matrices of the parameterizations in (B.28). First, 
we use (B.33) and (B.43) to obtain 



= h T co. 



We further calculate h from (B.32) as 



12 



Note from (B.28) that 



- h + cot— h 



0h + y(8)h. 



CO. 



dy 



It then follows from (B.48) and (B.49) that 

1 



de 



2 y(0)U + coA 2 



CO. 



Using the identities h 2 — hh T — h T Mt, and \h\ — 1, we rewrite (B.51) as 



|! - 1 7(0) cot |) hh T - l -y{Q)7i- ^7(0)cot^/ 3 



CO. 



(B.48) 



(B.49) 



(B.50) 



(B.51) 



(B.52) 



M>) 



Thus, s is singular whenever J s (s) is singular or unbounded. For example, when 
7(0) = sin I (i.e., s is the vector quaternion), J s (s) loses rank if cos f = 0, which 
implies that s T s — y(0) 2 h T h — 1. If s is the Gibb's vector, y(0) = tan f and J s (s) 
becomes unbounded at = it, which means that s is unbounded. For unit quaternion 
in (B.37), we compute its time derivative as 



1 



-<tv 



2 V <loh - <?v 



CO. 



(B.53) 



Mq) 



Note that J q (q) T J q (q) = 1(1 +^o)^3' which is never singular. For this reason, unit 
quaternion is a popular representation of 5(9(3). 
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B.14 Rigid Body Dynamics 

When a rigid body rotates at the angular velocity (0 in the inertial frame, the linear 
velocity of a point p on the rigid body is given by dip. Let p (p) be the density of 
the body at point p. Then the total rotational energy of this rigid body is given 

K= l -J v p{p){o) P ) T {o>p)dV (B.54) 

which is equivalent to 

K= 1 -cd t J p{p){p T ph-pp T )dV(o. (B.55) 



Defining 



we have 



p(p)(p T ph~PP T )dV, (B.56) 



v 

1 



K=-(Q 1 J r (0. (B.57) 

The matrix 2 is the inertia matrix of the rigid body represented in the inertial 
frame. It is always positive definite. If the body is rotating, 4 also changes. 

The angular momentum of the body is given by M — J? co in the inertial frame. 
The time derivative of M is the torque x acting on the body represented in the inertial 
frame, i.e., 

dM _ d(/G)) 



dt dt 



= x. (B.58) 



As the body rotates, the density p (p) does not change. Therefore, using p = S)p, we 
obtain from (B.56) that 

(j> J -/(£>. (B.59) 



dt 
It then follows that 

c /(b + aJ r (0 = T, (B.60) 

which is the Euler equation. 

We can also write (B.60) in the body frame. We use (B.44) and (B.45) in (B.60) 
and obtain 

f d{R(Q B ) 

^ dt 
which can be simplified as 



R(0 B J Rco 8 = T (B.61) 



i / b 6) b + (O b / b (O b = t b , (B.62) 

where x B — R T T is the torque x represented in the body frame and 

J B =R T JR (B.63) 
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is the inertia matrix represented in the body frame. Note that (B.60) and (B.62) are 
of the same form. 
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